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HUMAIM NEURONAL NICOTINIC ACETYLCHOLINE RECEPTOR 
COMPOSITIONS AND METHODS EMPLOYING SAME 

RELATED APPLICATIONS 

For U.S. national purposes, this application is a continuation-in-part 
of U.S. application Serial No. 08/484,722, by Elliott etaL, entitled 
"HUMAN NEURONAL NICOTINIC ACETYLCHOLINE RECEPTOR 
5 COMPOSITIONS AND METHODS EMPLOYING SAME", filed June 7, 
1995. The subject matter of U.S. application Serial No. 08/484,722, is 
herein incorporated in its entirety by reference thereto. 

This application is also related to U.S. Patent No. 5,369,028 and 
U.S. application Serial Nos. 08/028,031, 08/149,503, 08/496,855, 

10 07/938,154, 08/467,574, 08/466,589, 08/487,596. The subject 
matter of each of these applications and U.S. Patent is herein 
incorporpated by reference thereto. 
FIELD OF INVENTION 

This invention relates to nucleic acid molecules encoding human 

15 neuronal nicotinic acetylcholine receptor protein subunits, as well as the 
encoded proteins. In particular, human neuronal nicotinic acetylcholine 
receptor a-subunit-encoding DNA and RNA, a-subunit proteins, )5-subunit- 
encoding DNA and RNA , i9-subunit proteins, and combinations^ thereof 
are provided. 

20 BACKGROUND 

Ligand-gated ion channels provide a means for communication 
between cells of the central nervous system. These channels convert a 
signal ( e.c . a chemical referred to as a neurotransmitter) that is released 
by one cell into an electrical signal that propagates along a target cell 

25 membrane. A variety of neurotransmitters and neurotransmitter receptors 
exist in the central and peripheral nervous systems. Five families of 
ligand-gated receptors, including the nicotinic acetylcholine receptors 
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(nAChRs) of neuromuscular and neuronal origins, have been identified 
(Stroud etaL 1990 Biochemistry 29:11009-110231. There is, however, 
little understanding of the manner in which the variety of receptors 
generates different responses to neurotransmitters or to other modulating 
5 ligands in different regions of the nervous system. 

The nicotinic acetylcholine receptors (nAChRs) are multisubunit 
proteins of neuromuscular and neuronal origins. These receptors form 
ligand-gated ion channels that mediate synaptic transmission between 
nerve and muscle and between neurons upon interaction with the 

10 neurotransmitter acetylcholine (ACh). Since various neuronal nicotinic 
acetylcholine receptor (nAChR) subunits exist, a variety of nAChR 
compositions (La,, combinations of subunits) exist. The different nAChR 
compositions exhibit different specificities for various ligands and are 
thereby pharmacologically distinguishable. Thus, the nicotinic 

15 acetylcholine receptors expressed at the vertebrate neuromuscular 

junction, in vertebrate sympathetic ganglia and in the vertebrate central 
nervous system have been distinguished on the basis of the effects of 
various ligands that bind to different nAChR compositions. For example, 
the elapid cr-neurotoxins that block activation of nicotinic acetylcholine 

20 receptors at the neuromuscular junction do not block activation of some 
neuronal nicotinic acetylcholine receptors that are expressed on several 
different neuron-derived cell lines. 

Muscle nAChR is a glycoprotein composed of five subunits with 
the stoichimetry (a) 2P W or €)6, Each of the subunits has a mass of 

25 about 50-60 kilodaltons (kd) and is encoded by a different gene. The 
(a)2l3(Y or complex forms functional receptors containing two ligand 
binding sites and a ligand-gated transmembrane channel. Upon 
interaction with a cholinergic agonist, muscle nicotinic nAChRs conduct 
sodium ions. The influx of sodium ions rapidly short-circuits the normal 
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ionic gradient maintained across the plasma membrane, thereby 
depolarizing the membrane. By reducing the potential difference across 
the membrane, a chemical signal is transduced into an electrical signal at 
the neuromuscular junction that induces muscle contraction. 
5 Functional muscle nicotinic acetylcholine receptors have been 

formed with apSy subunits, apy subunits, aps subunits, aSy subunits, but 
not only with one subunit (see, e.g. . Kurosaki et aL {1 987) FEBS Lett. 
214 253-258; Comacho et ah (1 993) J. Neuroscience 11:605-61 3). In 
contrast, functional neuronal nAChRs can be formed from a subunits 

10 alone or combinations of a and P subunits. The larger a subunit is 

generally believed to be a ACh-binding subunit and the lower molecular 
weight P subunit is generally believed to be the structural subunit, 
although it has not been definitely demonstrated that the P subunit does 
not have the ability to bind ACh or participate in the formation of the ACh 

15 binding site. Each of the subunits which participate in the formation of a 
functional ion channel are, to the extent they contribute to the structure 
of the resulting channel, "structural" subunits, regardless of their ability 
(or inability) to bind ACh. Neuronal nAChRs, which are also ligand-gated 
ion channels, are expressed in ganglia of the autonomic nervous system 

20 and in the central nervous system (where they mediate signal 

transmission), and in pre- and extra-synaptic locations (where they 
modulate neurotransmission and may have additional functions; 
Wonnacott et aL (1 990) In: progress in Brain Research . A. Nordberg et 
aL, Eds., Elsevier, Amsterdam) 157-163. 

25 DNA encoding nAChRs has been isolated from several sources. 

Based on the information available from such work, it has been evident 
for some time that nAChRs expressed in muscle, in autonomic ganglia, 
and in the central nervous system are functionally diverse. This 
functional diversity could be due, at least i^i part, to the large number of 
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different nAChR subunits which exist. There is an incomplete 
understanding, however, of how (and which) nAChR subunits combine to 
generate unique nAChR subtypes, particularly in neuronal cells. Indeed, 
there is evidence that only certain nAChR subtypes may be involved in 
5 disease such as Alzheimer's disease. Moreover, it is not clear whether 
nAChRs from analogous tissues or cell types are similar across species. 

Accordingly, there is a need for the isolation and characterization 
of DNAs encoding each human neuronal nAChR subunit, recombinant 
cells containing such subunits and receptors prepared therefrom. In order 

10 to study the function of human neuronal nAChRs and to obtain disease- 
specific pharmacologically active agents, there is also a need to obtain 
isolated (preferably purified) human neuronal nAChRs, and isolated 
(preferably purified) human neuronal nAChR subunits. In addition, there 
is also a need to develop assays to identify such pharmacologically active 

15 agents. 

The availability of such nucleic acids, cells, receptor subunits and 
receptor compositions will eliminate the uncertainty of . speculating as to 
human neuronal nAChR structure and function based on predictions 
drawn from non-human nAChR data, or human or non-human muscle or 
20 ganglia nAChR data. 

i 

Therefore, It is an object herein to isolate and characterize DNA 
encoding subunits of human neuronal nicotinic acetylcholine receptors. It 
is also an object herein to provide methods for recombinant production of 
human neuronal nicotinic acetylcholine receptor subunits. It is also an 
25 object herein to provide purified receptor subunits and to provide methods 
for screening compounds to identify compounds that modulate the 
activity of human neuronal nAChRs. 

These and other objects will become apparent to those of skill in 
the art upon further study of the specification and claims. 
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SUMMARY OF THE INVENTION 

Isolated nucleic acid molecules encoding human alpha (a) and 
beta ifi) subunits of neuronal nAChRs are provided. In particular, isolated 
DMA and RNA molecules encoding human and 02 subunits of neuronal 
5 nAChRs are provided. Messenger RNA and polypeptides encoded by the 
DNA are also provided. 

Recombinant human neuronal nicotinic nAChR subunits, including 
ag and subunits, as well as methods for the production thereof are also 
provided, in addition, recombinant human neuronal nicotinic 
10 acetylcholine receptors containing at least one human neuronal nicotinic 
nAChR subunit are also provided, as well as methods for the production 
thereof. Also provided are recombinant neuronal nicotinic nAChRs that 
contain a mixture of one or more nAChR subunits encoded by a host cell, 
and one or more nAChR subunits encoded by heterologous DNA or RNA 
15 ( i.e. . DNA or RNA as described herein that has been introduced into the 
host cell), as well as methods for the production thereof. 

Plasmids containing DNA encoding the above-described subunits 
are also provided. Recombinant cells containing the above-described 
DNA, mRNA or plasmids are also provided herein. Such cells are useful, 
20 for example, for replicating DNA, for producing human nAChR subunits 
and recombinant receptors, and for producing cells that express receptors 
containing one or more human subunits. 

The DNA, RNA, vectors, receptor subunits, receptor subunit 
combinations and cells provided herein permit production of selected 
25 neuronal nicotinic nAChR receptor subtypes and specific combinations 
thereof, as well as antibodies to the receptor subunits. This provides a 
means to prepare synthetic or recombinant receptors and receptor 
subunits that are substantially free of contamination from many other 
receptor proteins whose presence can inter j'ere with analysis of a single 
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nAChR subtype. The availability of desired receptor subtypes makes it 
possible to observe the effect of a drug substance on a particular receptor 
subtype and to thereby perform initial in vitro screening of the drug 
substance in a test system that is specific for humans and specific for a 
5 human neuronal nicotinic nAChR subtype. 

Also provided herein, are single-stranded probes containing 
portions of the DNA molecules described herein and antibodies that 
specifically bind to proteins encoded by the DNA. Also provided herein Is 
an isolated nucleic acid molecule containing nucleotides 98-21 1 of SEQ 
10 IDN0:15. 

Proteins encoded by the DNA are also provided. The proteins may 
be prepared by expressing the DNA in a suitable prokaryotic or eukaryotic 
host ceil and isolating the resulting protein. 

Methods for identifying functional neuronal nicotinic acetylcholine 

15 receptor subunits and combinations thereof are also provided. 

Assays for identifying compounds that modulate the activity of 
human nicotinic acetylcholine receptors are also provided. The ability to 
screen drug substances in vitro to determine the effect of the drug on 
specific receptor compositions should permit the development and 

20 screening of receptor subtype-specific or disease-specific drugp. Also, 
testing of single receptor subunits or specific receptor subtype 
combinations with a variety of potential agonists or antagonists provides 
additional information with respect to the function and activity of the 
individual subunits and should lead to the identification and design of 

25 compounds that are capable of very specific interaction with one or more 
of the receptor subunits or receptor subtypes. The resulting drugs should 
exhibit fewer unwanted side effects than drugs identified by screening 
with cells that express a variety of subtypes. 
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Further in relation to drug development and therapeutic treatment 
of various disease states, the availability of DNA and RNA encoding 
human neuronal nAChR subunits provides a means for identification of 
any alterations in such genes ( e.g. , mutations) that may correlate with 
5 the occurrence of certain disease states. In addition, the creation of 
animal models of such disease states becomes possible, by specifically 
introducing such mutations into synthetic DNA sequences which can then 
be introduced into laboratory animals or in vitro assay systems to 
determine the effects thereof. 
10 BRIEF DESCRIPTION OF FIGURES 

Figure 1 presents a restriction map of a cytomegalovirus (CMV) 
promoter-based vector pcDNA3-KEalpha6.3 that contains an (7e-encoding 
fragment as an £coRI insert. 

Figure 2 presents a restriction map of a CMV promoter-based 
15 vector pcDNA3-KBbeta3.2 that contains a i^s-encoding fragment as an 
fcoRI insert. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
Definitions 

Unless defined otherwise, all technical and scientific terms used 
20 herein have the same meaning as is commonly understood by one of skill 
in the art to which this invention belongs. All patents and publications 
referred to herein are, unless noted otherwise, incorporated by reference 
in their entirety. 

As used herein, isolated (or substantially purified or pure) as a 
25 modifier of nucleic acid molecule, DNA, RNA, polypeptides or proteins 
means that the DNA, RNA, polypeptides or proteins so-designated have 
been separated from their in vivo cellular environments through the hand 
of man. Thus, for example, as used herein, isolated (or substantially 
pure) DNA refers to DNA fragments purified according to standard 
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techniques employed by those skilled in the art (see, e^, Maniatis et aL 
(1982) Molecular Clonino: A Laboratory Manual . Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY). 

Similarly, as used herein, "recombinant" as a modifier of DNA, 
5 RNA, polypeptides or proteins means that the DNA, RNA, polypeptides or 
proteins so designated have been prepared by the efforts of human 
beings, e.g. . by cloning, recombinant expression, or such method. Thus, 
as used herein, recombinant proteins, for example, refers to proteins 
produced by a recombinant host expressing DNAs which have been 

10 added to that host through the efforts of human beings. 

As used herein, vector (or plasmid) refers to discrete elements that 
are used to introduce heterologous DNA into cells for either expression or 
replication thereof. Selection and use of such vehicles are well within the 
level of skill of the art. An expression vector includes vectors capable of 

15 expressing DNA that is operatively linked with regulatory sequences, such 
as promoter regions, that are capable of effecting expression of such 
DNA fragments. Thus, an expression vector refers to a recombinant DNA 
or RNA construct, such as plasmid, a phage, recombinant virus or other 
vector that, upon introduction to a host cell, allows expression of DNA 

20 cloned into the appropriate site on the vector. Appropriate expression 
vectors are well known to those of skill in the art and include those that 
are replicable in eukaryotic cells and/or prokaryotic cells and those that 
remain episomal or those which integrate into the host cell genome. 
Presently preferred plasmids for expression of the nAChR subunits in 

25 eukaryotic host cells, particularly mammalian cells, include, but are not 
limited to, cytomegalovirus (CMV), Simian virus 40 (SV40) and mouse 
mammary tumor virus (MMTV) promoter-containing vectors such as 
pCMV, pcDNAI, pcDNA3, pZeoSV, pCEP4, pMAMneo and pMAMhyg. 
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As used herein, a promoter region refers to a segnnent of DNA that 
controls transcription of DNA to which it is operatively linlced. The 
promoter region includes specific sequences that are sufficient for RNA 
polymerase recognition, binding and transcription initiation. This portion 
5 of the promoter region is referred to as the promoter. In addition, the 
promoter region includes sequences that modulate this recognition, 
binding and transcription initiation activity of RNA polymerase. These 
sequences may be cis acting or may be responsive to trans acting 
factors. Promoters, depending upon the nature of the regulation, may be 

10 constitutive or regulated. Exemplary promoters contemplated for use 
herein include the SV40 early promoter, the cytomegalovirus (CMV) 
promoter, the mouse mammary tumor virus (MMTV) steroid-inducible 
promoter, and Moloney murine leukemia virus (MMLV) promoter, and 
other suitable promoters. 

15 As used herein, the term "operatively linked" refers to the 

functional relationship of DNA with regulatory and effector sequences of 
nucleotides, such as promoters, enhancers, transcriptional and 
translational start and stop sites, and other signal sequences. For 
example, operative linkage of DNA to a promoter refers to the physical 

20 and functional relationship between the DNA and the promoter such that 
the transcript of such DNA is initiated from the promoter by an RNA 
polymerase that specifically recognizes, binds to and transcribes the 
DNA. In order to optimize expression and/or in vitro transcription, it may 
be necessary to remove or alter 5' untranslated portions of the clones to 

25 remove extra, potential alternative translation initiation ( i.e. , start) codons 
or other sequences that interfere with or reduce expression, either at the 
level of transcription or translation. Alternatively, consensus ribosome 
binding sites (see, for example, Kozak (1991) J. Biol. Chem. 266 :19867- 
1 9870) can be inserted immediately 5' of the start codon to enhance 
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expression. The desirability of (or need for ) such modification may be 
empirically determined. 

As used herein, expression refers to the process by which 
polynucleic acids are transcribed into mRNA and translated into peptides, 
5 polypeptides, or proteins. If the polynucleic acid is derived from genomic 
DNA, expression may, if an appropriate eukaryotic host cell or organism 
is selected, include splicing of the mRNA. 

Particularly preferred vectors for transfection of mammalian cells 
are the SV40 promoter-based expression vectors, such as pZeoSV 
10 (Invitrogen, San Diego, CA), CMV promoter-based vectors such as 
pcDNAI , PCDNA3, pCEP4 (Invitrogen, San Diego, CA), and MMTV 
promoter-based vectors such as pMAMneo (Clontech, Inc.). 

As used herein, a human alpha (a) subunit gene is a gene that 
encodes an alpha subunit of a human neuronal nicotinic acetylcholine 
15 receptor. Alpha subunits of human nAChRs typically exhibit a 

conservation of adjacent cysteine residues in the presumed extracellular 
domain of the subunit that are the homologs of cysteines 192 and 193 of 
the Torpedo alpha subunit (see Noda ej aL (1982) Nature 299:793-797> 
As used herein, an alpha subunit subtype refers to a human 
20 neuronal nAChR subunit that is encoded by DNA that hybridizes under 
high stringency conditions to at least one of the neuronal nAChR alpha 
subunit-encoding DNA clones disclosed herein. An alpha subunit 
generally binds to ACh under physiological conditions and at physiological 
concentrations and, in the optional presence of a beta subunit (Le,, some 
25 alpha subunits are functional alone, while others require the presence of a 
beta subunit), generally forms a functional oAChR as assessed by 
methods described herein or known to those of skill in this art. 

Also contemplated are alpha subunits encoded by DNA molecules 
that encode alpha subunits as defined abo\.ls, but that by virtue of 
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degeneracy of the genetic code do not necessarily hybridize to the 
disclosed DNA under specified hybridization conditions. Such subunits 
also form a functional receptor, as assessed by the methods described 
herein or known to those of skill in the art, generally with one or more 
5 beta subunit subtypes. Typically, unless an alpha subunit is encoded by 
RNA that arises from alternative splicing (Le., a splice variant), alpha- 
encoding DNA and the alpha subunit encoded thereby share substantial 
sequence homology with at least one of the alpha subunit DNAs (and 
proteins encoded thereby) described herein. It is understood that DNA or 

10 RNA encoding a splice variant may overall share less than 90% homology 
with the DNA or RNA provided herein, but include regions of nearly 
100% homology to a DNA fragment described herein, and encode an 
open reading frame that includes start and stop codons and encodes a 
functional alpha subunit. 

15 As used herein, a human beta ifi) subunit gene is a gene that 

encodes a beta subunit of a human neuronal nicotinic acetylcholine 
receptor. Assignment of the name "beta" to a putative neuronal nAChR 
subunit has been based on the lack of adjacent cysteine residues (which 
residues are characteristic of alpha subunits). The beta subunit is 

20 frequently referred to as the structural nAChR subunit (although it is 
possible that beta subunits also have ACh binding properties). 
Combination of the appropriate beta subunit(s) with appropriate alpha 
subunit(s) leads to the formation of a functional receptor. 

As used herein, a beta subunit subtype refers to a neuronal nAChR 

25 subunit that is encoded by DNA that hybridizes under high stringency 
conditions to at least one of the neuronal nAChR-encoding DNAs 
disclosed herein. A beta subunit may form a functional nAChR, as 
assessed by methods described herein or known to those of skill in this 
art, with appropriate alpha subunit subtype(s). 
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Also contemplated are beta subunits encoded by DNA that encodes 
beta subunits as defined above, but that by virtue of degeneracy of the 
genetic code do not necessarily hybridize to the disclosed DNA under the 
specified hybridization conditions. Such subunits may also form 
5 functional receptors, as assessed by the methods described herein or 
known to those of skill in the art, in combination with appropriate alpha 
subunit subtype(s). Typically, unless a beta subunit is encoded by RNA 
that arises as a splice variant, beta-encoding DNA and the beta subunit 
encoded thereby share substantial sequence homology with the beta- 
10 encoding DNA and beta subunit protein described herein. It is understood 
that DNA or RNA encoding a splice variant may share less that 90% 
overall homology with the DNA or RNA provided herein, but such DNA 
will include regions of nearly 100% homology to the DNA described 
herein. 

15 As used herein, a nAChR subtype refers to a nicotinic acetylcholine 

receptor containing a particular combination of a and/or P subunit 
subtypes, e.g., a receptor containing human nAChR and subunits. 

As used herein, a splice variant refers to variant nAChR subunit- 
encoding nucleic acid(s) produced by differential processing of primary 

20 transcript(s) of genomic DNA, resulting in the production of more than 
one type of mRNA. cDNA derived from differentially processed genomic 
DNA will encode nAChR subunits that have regions of complete amino 
acid identify and regions having different amino acid sequences. Thus, 
the same genomic sequence can lead to the production of multiple, 

25 related mRNAs and proteins. The resulting mRNA and proteins are 
referred to as "splice variants". 

As used herein, heterologous or foreign DNA and RNA are used 
interchangeably and refer to DNA or RNA that does not occur naturally as 
part of the genome in which it is present or which is found in a location 
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or locations in the genome that differ from that in which it occurs in 
nature. It is typically DNA or RNA that is not endogenous to the cell and 
has been artificially introduced into the cell. Examples of heterologous 
DNA include, but are not limited to, DNA that encodes a human nAChR 
5 subunit and DNA that encodes RNA or proteins that mediate or alter 
expression of endogenous DNA by affecting transcription, translation, or 
other regulatable biochemical processes. The cell that expresses the 
heterologous DNA, such as DNA encoding a human nAChR subunit, may 
contain DNA encoding the same or different nicotinic acetylcholine 

10 receptor subunits. The heterologous DNA need not be expressed and 
may be introduced in a manner such that It is integrated into the host cell 
genome or is maintained episomally. 

Stringency of hybridization is used herein to refer to conditions 
under which polynucleic acid hybrids are stable. As known to those of 

15 skill in the art, the stability of hybrids is reflected in the melting 

temperature (T J of the hybrids. T^ can be approximated by the formula: 

81.5*C - 16.6 (logio[Na*]) + 0.41 (%G + C) - 600/1, 

where I is the length of the hybrids in nucleotides. T^ 

decreases approximately l-I.B'^C with every 1% decrease in sequence 

20 homology. In general, the stability of a hybrid is a function of sodium ion 
concentration and temperature. Typically, the hybridization reaction is 
performed under conditions of lower stringency, followed by washes of 
varying, but higher, stringency. Reference to hybridization stringency 
relates to such washing conditions. 

25 As used herein: 

(1) HIGH STRINGENCY conditions, with respect to fragment 
hybridization, refer to conditions that permit hybridization of only those 
nucleic acid sequences that form stable hybrids in 0.01 8M NaCI at eS^'C 
(Le., if a hybrid is not stable in 0.01 8M NaCI at 65°C, it will not be stable 
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under high stringency conditions, as contemplated herein). High 
stringency conditions can be provided, for example, by hybridization in 
50% formamide, 5X Denhardt's solution, 5X SSPE, 0.2% SDS, 200 
//g/ml denaturated sonicated herring sperm DNA, at 42°C, followed by 
5 washing in 0.1 X SSPE, and 0.1% SDS at 65" C; 

(2) MODERATE STRINGENCY conditions, with respect to fragment 
hybridization, refer to conditions equivalent to hybridization in 50% 
formamide, 5X Denhardt's solution, 5X SSPE, 0.2% SDS, 200 //g/ml 
denatured sonicated herring sperm DNA, at 42*^0, followed by washing in 

10 0.2X SSPE, 0.2% SDS, at 60'C; 

(3) LOW STRINGENCY conditions, with respect to fragment 
hybridization, refer to conditions equivalent to hybridization in 1 0% 
formamide, 5X Denhardt's solution, 6X SSPE, 0.2% SDS, 200 /jg/m\ 
denatured sonicated herring sperm DNA, followed by washing in 1 X 

15 SSPE, 0.2% SDS, at 50'C; and 

(4) HIGH STRINGENCY conditions, with respect to oligonucleotide 
( i.e. . synthetic DNA < about 30 nucleotides in length) hybridization, refer 
to conditions equivalent to hybridization in 1 0% formamide, 5X 
Denhardt's solution, 6X SSPE, 0.2% SDS, 200 //g/ml denatured 

20 sonicated herring sperm DNA, at 42'C, followed by washing in 1 X SSPE, 
and 0.2% SDS at 50"C. 

It is understood that these conditions may be duplicated using a 
variety of buffers and temperatures and that they are not necessarily 
precise. 

25 Denhardt's solution and SSPE (see, e^, Sambrook et aL (1989) 

Molecular Cloning: A Laboratory ManuaL 2nd Ed., Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, NY) are well known to those of 
skill in the art as are other suitable hybridization buffers. For example, 
SSPE is pH 7.4 phosphate-buffered 0.1 8M NaCI. SSPE can be prepared. 



wo 96/41876 



PCTAJS96/09775 



-15- 

for example, as a 20X stock solution by dissolving 175.3 g of NaCi, 27.6 
g of NaH2P04 and 7.4 g EDTA in 800 ml of water, adjusting the pH to 
7.4, and then adding water to 1 liter. Denhardt's solution (see, Denhardt ^ 
(1966) Biochem. Biohphvs. Res. Commun. 23:641) can be prepared, for 
5 example, as a BOX stock solution by mixing 5 g Ficoll {Type 400, 
Pharmacia LKB Biotechnology, INC., Piscataway NJ), 5 g of 
polyvinylpyrrolidone, 5 g bovine serum albumin (Fraction V; Sigma, St. 
Louis MO) water to 500 ml and filtering to remove particulate matter. 

As used herein, the phrase "substantial sequence homology" refers 
10 to two sequences of nucleotides that share at least about 90% identity, 
and amino acid sequences which typically share greater than 95% amino 
acid identity. It is recognized, however, that proteins (and DNA or mRNA 
encoding such proteins) containing less than the above-described level of 
homology arising as splice variants or that are modified by conservative 
15 amino acid substitutions {or substitution of degenerate codons) are 
contemplated herein. 

The phrase "substantially the same" is used herein in reference to 
the nucleotide sequence of DNA, the ribonucleotide sequence of RNA, or 
the amino acid sequence or protein, that have slight and non- 
20 consequential sequence variations from the actual sequences disclosed 
herein. Species that are substantially the same are considered to be 
functionally equivalent to the disclosed sequences. Thus, as used herein 
functionally equivalent nucleic acid molecules or proteins are those that 
are sufficiently similar to function in substantially the same manner to 
25 achieve substantially the same results. 

As used herein, "slight and non-consequential sequence variations" 
mean that sequences that are substantially the same as the DNA, RNA, 
or proteins disclosed and claimed herein are functionally equivalent to the 
human-derived sequences disclosed and claimed herein. Functionally 
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equivalent sequences will function in substantially the same manner to 
produce substantially the same compositions as the human-derived 
nucleic acid and amino acid compositions disclosed and claimed herein. 
In particular, functionally equivalent DNA molecules encode human- 
5 derived proteins that are the same as those disclosed herein or that have 
conservative amino acid variations, such as substitution of a non-polar 
residue for another non-polar residue or a charged residue for a similarly 
charged residue (see, e.g. . Table 1). These changes include those 
recognized by those of skill in the art as those that do not substantially 

10 alter the tertiary structure of the protein. 

Suitable conservative substitutions of amino acids are known to 
those of skill in this art and may be made generally without altering the 
biological activity of the resulting molecule. Those of skill in this art 
recognize that, in general, single amino acid substitutions in non-essential 

15 regions of a polypeptide do not substantially alter biological activity (see, 
e.g. . Watson ej aL Molecular Biology of the Gene, 4th Edition, 1 987, The 
Benjamin/Cummings Pub. co., p.224). Such substitutions are preferably 
made in accordance with those set forth in TABLE 1 as follows: 

20 TABLE 1 



Original residue 



Conservative substitution 



30 



25 



35 



Ala (A) 
Arg (R) 
Asn (N) 
Cys (C) 
Gin (Q) 
Glu (E) 
Qly (G) 
His (H) 
lie (U 
Leu (L) 
Lys (Kl 
Met (M) 
Phe (F) 
Ser (S) 
Thr (T) 
Trp (W) 



Ser; neutral amino acids 



lie; Val 
Arg; GIny; Glu 
Leu; Tyr; lie 
Met; Leu; Tyr 



Asn 

Asp 
Ala; Pro 
Asn; Gin 
Leu; Val 



Gly; Ser 
Lys 

Gin; His 



Thr 
Ser 
Tyr 



wo 96/41876 



PCT/US96/09775 



-17- 



Original residue 
Tyr (Y) 

Val (V) 



Conservative substitution 
Trp; Phe 

lie; Leu 



10 



15 



20 



Other substitutions are also permissible and may be determined 
empirically or in accord with known conservative substitutions. Any such 
modification of the polypeptide may be effected by any means known to 
those of skill in this art. 

As used herein, activity of a human neuronal nAChR refers to any 
activity characteristic of an nAChR. Such activity can typically be 
measured by one or more in vitro methods, and frequently corresponds to 
an in vivo activity of a human neuronal nAChR. Such activity may be 
measured by any method known to those of skill in the art, such as, for 
example, measuring the amount of current which flows through the 
recombinant channel in response to a stimulus. 

Methods to determine the presence and/or activity of human 
neuronal nAChRs include, but are not limited to, assays that measure 
nicotine binding, ®®Rb ion-flux, Ca^* influx, the electrophysiological 
response of cells, the electrophysiological response of oocytes injected 
with RNA. In particular, methods are provided herein for the 
measurement or detection of an nAChR-mediated response upon contact 
of cells containing the DNA or mRNA with a test compound. 

As used herein, a recombinant or heterologous human neuronal 
nAChR refers to a receptor that contains one or more subunits encoded 
by heterologous DNA that has been introduced into and expressed in cells 
capable of expressing receptor protein. A recombinant human neuronal 
nAChR may also include subunits that are produced by DNA endogenous 
to the host cell. In certain embodiments, recombinant or heterologous 
human neuronal nAChR may contain only subunits that are encoded by 
heterologous DNA. 
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As used herein, a functional neuronal nAChR is a receptor that 
exhibits an activity of neuronal nicotinic nAChRs as assessed by any in 
vitro ox in vivo assay disclosed herein or known to those of skill in the 
art. Possession of any such activity that may be assessed by any 
5 methods known to those of skill in the art and provided herein is 

sufficient to designate a receptor as functional/ Methods for detecting 
nAChR protein and/or activity include, but are not limited to, for example, 
assays that measure nicotine binding, ^^Rb ion-flux, Ca^* influx and the 
electrophysiological response of cells containing heterologous DNA or 

10 mRNA encoding one or more receptor subunit subtypes. Since all 
combinations of alpha and beta subunits may not form functional 
receptors, numerous combinations of alpha and beta subunits may be 
tested in order to fully characterize a particular subunit and cells which 
produce same. Thus, as used herein, "functional" with respect to a 

15 recombinant or heterologous human neuronal nAChR means that the 
receptor channel Is able to provide for and regulate entry of human 
neuronal nAChR-permeable ions, such as, for example, Na*, K+, Ca^* or 
Ba^*, in response to a stimulus and/or bind ligands with affinity for the 
receptor. Preferably such human neuronal nAChR activity is 

20 distinguishable, such as by electrophysiological, pharmacological and 
other means known to those of skill in the art, from any endogenous 
nAChR activity that may be produced by the host cell. 

As used herein, one type of a "control" cell or "control" culture is a 
cell or culture that is treated substantially the same as the cell or culture 

25 exposed to the test compound except that the control culture is not 
exposed to the test compound. Another type of a "control" cell or 
"control" culture may be a cell or a culture of cells which are identical to 
the transfected cells except the cells employed for the control culture do 
not express functional nicotinic acetylcholine receptors. In this situation, 
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the response of test cell to the test compound is compared to the 
response (or lack of response) of the nicotinic acetylcholine 
receptor-negative cell to the test compound, when cells or cultures of 
each type of cell are exposed to substantially the same reaction 
5 conditions in the presence of the compound being assayed. 

As used herein, a compound or signal that "modulates the activity 
of a neuronal nAChR" refers to a compound or signal that alters the 
activity of nAChR so that activity of the nAChR is different in the 
presence of the compound or signal than in the absence of the compound 

10 or signal. In particular, such compounds or signals include agonists and 
antagonists. The term agonist refers to a substance or signal, such as 
ACh, that activates receptor function; and the term antagonist refers to a 
substance that interferes with receptor function. Typically, the effect of 
an antagonist is observed as a blocking of activation by an agonist. 

15 Antagonists include competitive and non-competitive antagonists, a 

competitive antagonist (or competitive blocker) Interacts with or near the 
site specific for the agonist ( e.c . ligand or neurotransmitter) for the same 
or closely situated site. A non-competitive antagonist or blocker 
inactivates the functioning of the receptor by interacting with a site other 

20 than the site that interacts with the agonist. 
A. Isolated DNA clones 

DNA molecules encoding human alpha and beta subunits of 
neuronal nAChRs are provided. Specifically, isolated DNAs encoding Oq 
and p2 subunits of human neuronal nAChRs are described herein. 

25 Recombinant messenger RNA (mRNA) and recombinant polypeptides 
encoded by the above-described DNA are also provided. 

For purposes herein, "a^ subunit-encoding nucleic acid " refers to 
DNA or RNA encoding a neuronal nicotinic acetylcholine receptor subunit 
of the same name. Such nucleic acid molecules can be characterized in a 
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number of ways, for example the nucleotides of the DNA (or 
ribonucleotides of the RNA) may encode the amino acid sequence set 
forth in SEQ ID NO: 10 or SEQ ID NO: 20. 

Presently preferred ag-encoding nucleic acid includes DNA or RNA 
5 that hybridizes to the coding sequence set forth in SEQ ID N0:9 
(preferably to substantially the entire coding sequence thereof, i.e. . 
nucleotides 143-1624) or SEQ ID NO: 19 (preferably to substantially the 
entire coding sequence thereof, i.e,, nucleotides 143-1579) under high 
stringency conditions. 

10 Especially preferred ag-encoding nucleic acid molecules are those 

that encode a protein having substantially the same amino acid sequence 
(Le., with only conservative amino acid substitutions) as that set forth in 
SEQ ID NO:iO or SEQ ID N0:20. Most preferred molecules include a 
sequence of nucleotides (or ribonucleotides with U substituted for T) 

15 having substantially the same sequence of nucleotides as set forth in SEQ 
ID NO: 9 (i.e. . particularly nucleotides 143-1624 thereof) or SEQ ID 
NO: 19 (i.e., particularly nucleotides 143-1579 thereof). 

Typically, unless an subunit arises as a splice variant, a^- 
encoding DNA will share substantial sequence homology (Le. greater than 

20 about 90%), with a Grg-encoding nucleic acid molecules described herein. 
DNA or RNA encoding a splice variant may share less than 90% overall 
sequence homology with the DNA or RNA provided herein, but such a 
splice variant would include regions of nearly 100% homology to one or 
more of the nucleic acid molecules provided herein. 

25 Also provided herein are "ySg subunit-encoding nucleic acids", which 

include DNA or RNA molecules that encode a neuronal nicotinic 
acetylcholine receptor subunit of the same name. Such nucleic acid 
molecules can be characterized in a number of ways, for example, the 
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nucleotides of the DNA (or ribonucleotides of the RNA) may encode the 
amino acid sequence set forth in SEQ ID NO: 16. 

Presently preferred ^3-encoding nucleic acid includes DNA or RNA 
that hybridizes under high stringency conditions to the coding sequence 
5 set forth in SEQ ID NO: 15 (preferably to substantially the entire coding 
sequence thereof, Le^, nucleotides 98-1471 ). More preferred are those 
nucleic acids that encode a protein that includes the sequence of amino 
acids (or substantially the sequence of amino acids with only 
conservative amino acid substitutions) set forth in SEQ ID NO: 16. 

10 Especially preferred ^Ss-encoding nucleic acid molecules provided herein 
have substantially the same nucleotide sequence as set forth in SEQ ID 
N0:15 (i.e., particularly nucleotides 98*1471 thereof). 

Typically, unless a subunit arises as a splice variant, iffs-encoding 
nucleic acid will share substantial sequence homology (greater than about 

15 90%) with the i^s-encoding nucleic acid molecules described herein. DNA 
or RNA encoding a splice variant may share less than 90% overall 
sequence homology with the DNA or RNA provided herein, but such 
nucleic would include regions of nearly 1 00% homology to one or more 
of the above-described nucleic acid molecules. 

20 B. Probes 

DNA encoding human neuronal nicotinic nAChR alpha and beta 
subunits may be isolated by screening suitable human cDNA or human 
genomic libraries under suitable hybridization conditions with the DNA 
disclosed herein (including nucleotides derived from SEQ ID N0s:9 or 15). 

25 Suitable libraries can be prepared from tissues such as neuronal tissue 
samples, basal ganglia, thalamus, and hypothalamus tissues. The library 
is preferably screened with a portion of DNA including the entire subunit- 
encoding sequence thereof, or the library may be screened with a suitable 
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probe. Typically probes are labeled with an identifiable tag, such as a 
radiolabeL enzyme or other such tag known to those of skill in the art. 

Probes for use In methods of isolating a^- and iffg-encoding nucleic 
acids are also provided. Thus, for example, with reference to human 
5 subunits, a probe is a single-stranded DNA or RNA molecule that has a 
sequence of nucleotides that includes at at least 27 contiguous bases 
that are the same as (or the complement of) any 27 bases set forth in 
SEQ ID N0:9 or SEQ ID N0:19. 

With reference to human fi^ subunits, a probe is single-stranded 

10 DNA or RNA that has a sequence of nucleotides that includes at least 28 
contiguous bases that are the same as (or the complement of) any 28 
bases derived from the first 105 nucleotides of signal sequence/coding 
sequence set forth in SEQ ID NO: 15. 

Among the preferred regions from which to construct probes 

15 include, but are not limited to, 5' and/or 3' coding sequences, regions 
containing sequences predicted to encode transmembrane domains, 
regions containing sequences predicted to encode a cytoplasmic loop, 
signal sequences, and acetylcholine (ACh) and a-bungarotoxin (a-bgtx) 
binding sites. Amino acids that correspond to residues 1 90-1 98 of the 

20 Torpedo nAChR a subunit (see, e^, Karlin (1 993) Curr. Ooin. Neurobiol. 

3:299-309) are typically involved in ACh and a-bgtx binding. The 

approximate amino acid residues which include such regions for other 

probes are set forth in the following table. Table 2: 

Subunit Signal TIVID1' TMD2 TMD3 TMD4 Cytoplasmic 

Sequence loop 

25 ffe' 1-30 240-265 272-294 301-326 458-483 327-457 

iffg 1-20 231-258 265-287 293-318 421-446 319-420 

* TMD = transmembrane domain 

' With reference to the amino acid sequence shown in SEQ ID NO: 10. 
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Alternatively, portions of the DNA can be used as primers to amplify 
selected fragments in a particular library. 

C. Isolation of clones encoding and fi^ subunits of human neuronal 
5 nicotinic acetylcholine receptors 

The probes are used to screen a suitable library. Suitable libraries 

for obtaining DNA encoding each subunit include, but are not limited to: 

substantia nigra, thalamus or hypothalamus to isolate human ae-encoding 

DNA and substantia nigra or thalamus to isolate human ^ffs-encoding DNA. 

10 After screening the library, positive clones are identified by 

detecting a hybridization signal; the identified clones are characterized by 
restriction enzyme mapping and/or DNA sequence analysis, and then 
examined, by comparison with the sequences set forth herein, to 
ascertain whether they include DNA encoding a complete alpha or beta 

15 subunit. If the selected clones are incomplete, the may be used to 

rescreen the same or a different library to obtain overlapping clones. If 
desired, the library can be rescreened with positive clones until 
overlapping clones that encode an entire alpha or beta subunit are 
obtained. If the library is a cDNA library, then the overlapping clones will 

20 include an open reading frame. If the library is genomic, then the 

overlapping clones may include exons and introns. Complete clones may 
be identified by comparison with the DNA and encoded proteins provided 
herein. 

Complementary DNA clones encoding various subtypes of human 
25 neuronal nAChR alpha and beta subunits have been isolated. Each 

subtype of the subunit appears to be encoded by a different gene. The 
DNA clones provided herein may be used to isolate genomic clones 
encoding each subtype and to isolate any splice variants by screening 
libraries prepared from different neural tissues. Nucleic acid amplification 
30 techniques, which are well known in the art, can be used to locate splice 
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variants of human neuronal nAChR subunits. This is accomplished by 
employing oligonucleotides based on DNA sequences surrounding 
divergent sequence{s) as primers for amplifying human RNA or genomic 
DNA. Size and sequence determinations of the amplification products 
5 can reveal the existence of splice variants. Furthermore, isolation of 
human genomic DNA sequences by hybridization can yield DNA 
containing multiple exons, separated by introns, that correspond to 
different splice variants of transcripts encoding human neuronal nAChR 
subunits. 

10 It has been found that not all subunit subtypes are expressed in all 

neural tissues or in all portions of the brain. Thus, In order to isolate 
cDNA encoding particular subunit subtypes or splice variants of such 
subtypes, it is preferable to screen libraries prepared from different 
neuronal or neural tissues. 

15 D. Cells and vectors containing a^- and i?3«encodlng nucleic acids 
The above-described nucleic acid molecules encoding human 
nAChR subunits can be incorporated into vectors for further manipulation. 
Incorporation of cloned DNA into a suitable expression vector, 
transfection of eukaryotic cells with one or a combination of expression 

20 constructs encoding one or more distinct genes or with linear DNA, and 
selection of transfected cells are well known in the art (see, e.g. . 
Sambrook et aL (1 989) Molecular Clonino: A Laboratory Manual. 2nd 
Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY). 
Heterologous DNA may be introduced into host cells by any method 

25 known to those of skill in the art, such as transfection with an expression 
construct encoding the heterologous DNA by CaP04 precipitation (see, 
e^, Wigler et aL (1979) Proc. Natl. Acad. ScL U.S.A. 76: 1373-1376). 
Recombinant cells can then be cultured under conditions whereby the 
. subunit(s) encoded by the DNA is (are) expressed. Preferred cells 
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include, but are not limited to, mammalian cells ( e.g. , HEK 293, CHO and 
Ltk cells), yeast cells (e^, methylotrophic yeast cells, such as Pichia 
pastoris) and bacterial cells ( e.g. . Escherichia coii). 

The nucleic acids encoding Cq or fi^ subunits can be incorporated 
5 into vectors individually or in combination with nucleic acids encoding 
other nicotinic acetylcholine receptor subunits for further manipulation. 
Full-length DNA clones encoding human neuronal nAChR subunits have 
been inserted into vector pcDNA3, a pUC19-based mammalian cell 
expression vector containing the CMV promoter/enhancer, a polylinker 

10 downstream of the CMV promoter/enchancer, followed by the bovine 
growth hormone (BGH) polyadenylation signal. Placement of nAChR 
subunit-encoding DNA between the CMV promoter and BGH 
polyadenylation signal provides for constitutive expression of the DNA in 
a mammalian host cell transfected with the construct. For inducible 

15 expression of human nAChR subunit-encoding DNA in a mammalian cell, 
the DNA can be inserted into a plasmid such as pMAMneo. This plasmid 
contains the mouse mammary tumor virus (MMTV) promoter for steroid- 
inducibie expression of operatively associated foreign DNA. If the host 
cell does not express endogenous glucocorticoid receptors required for 

20 uptake of glucocorticoids ( i.e. , inducers of the MMTV promoter) into the 
cell, it is necessary to additionally transfect the cell with DNA encoding 
the glucocorticoid receptor (ATCC accession no. 67200). 

In accordance with another embodiment, there are provided cells 
containing the above-described polynucleic acids ( i.e. , DNA or mRNA). 

25 Host cells such as bacterial, yeast and mammalian cells can be used for 
replicating DNA and producing nAChR subunit(s). Methods for 
constructing expression vectors, preparing in vitro transcripts, 
transfecting DNA into mammalian cells, injecting oocytes, and performing 
electrophysiological and other analyses for assessing receptor expression 
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and function as described herein are also described in PCT Application 
Nos. PCT/US91/02311, PCT/US94/02447, PCT/US91/05625, and 
PCT/US92/11090, in U.S. Patent No. 5,369,028, and in co-pending U.S. 
Application Serial Nos. 07/563,751 and 07/812,254. The subject matter 
5 of these applications is hereby incorporated by reference herein in its 
entirety. 

While the DNA provided herein may be expressed in any eukaryotic 
cell, including yeast cells (such as, for example, Pichia, particularly Pichia 
pastoris (see U.S. Patent Nos. 4,882,279, 4,837,148, 4,929,555 and 

10 4,855,231), Saccharomyces cerevisiae, Candida tropicalis, Hansenula 
polymorpha, and other yeast cells), mammalian expression systems, 
including commercially available systems and other such systems known 
to those of skill in the art, for expression of DNA encoding the human 
neuronal nicotinic nAChR subunits provided herein are presently 

15 preferred. Xenopus oocytes are preferred for expression of RNA 
transcripts of the DNA. 

Cloned full-length DNA encoding any of the subunits of human 
neuronal nicotinic nAChR may be Introduced into a plasmid vector for 
expression in a eukaryotic cell. Such DNA may be genomic DNA or 

20 cDNA. Host cells may be transfected with one or a combination of 
plasmids, each of which encodes at least one human neuronal nAChR 
subunit. Heterologous DNA may be maintained in the cell as an episomal 
element or may be integrated into chromosomal DNA of the cell. 

Eukaryotic cells in which DNA or RNA may be introduced include 

25 any cells that are transfectable by such DNA or RNA or into which such 
DNA or RNA may be injected. Preferred cells are those that can be 
transiently or stably transfected and also express the DNA and RNA. 
Presently most preferred cells are those that can form recombinant or 
heterologous human neuronal nicotinic nAChRs containing one or more 
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subunits encoded by the heterologous DNA. Such cells may be identified 
empirically or selected from among those known to be readily transfected 
or injected. 

Exemplary cells for introducing DNA include, but are not limited to, 
5 cells of mammalian origin ( e.g. , COS cells, mouse L cells, Chinese 
hamster ovary (CHO) cells, human embryonic kidney (HEK) cells, GH3 
cells and other such cells known to those of skill in the art, amphibian 
cells (e.g. . Xenopus laevis oocytes) and yeast cells ( e.g. , Saccharomyces 
cerevisiae, Pichia pastoris). Exemplary cells for expressing injected RNA 

10 transcripts include Xenopus laevis oocytes. Cells that are preferred for 
transfection of DNA are known to those of skill in the art or may be 
empirically identified, and include HEK 293 (which are available from 
ATCC under accession #CRL 1573); Ltk' cells (which are available from 
ATCC under accession #CCL1.3); COS-7 cells (which are available from 

15 ATCC under accession #CRL 1651); and GH3 cells (which are available 
from ATCC under accession #CCL82.1). Presently preferred cells include 
GH3 cells and HEK 293 cells, particularly HEK 293 cells that have been 
adapted for growth in suspension and that can be frozen in liquid nitrogen 
and then thawed and regrown. HEK 293 cells are described, for 

20 example, in U.S. Patent No. 5,024,939 to Gorman (see, also, Stillman et 
aL (19851 Mol. Cell. Biol. 5:2051-2060), * 

DNA can be stably incorporated into cells or may be transiently 
introduced using methods known in the art. Stably transfected 
mammalian cells may be prepared by transfecting cells either with one or 

25 more expression constructs carrying DNA encoding nAChR subunits and 
a separate expression vector carrying a selectable marker gene ( e.g. , but 
not limited to, the gene for neomycin resistance, zeocin resistance, or 
hygromycin resistance) or with one or more expression constructs which 
carry the DNA encoding nAChR subunit and the selectable marker, and 
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growing the transfected cells under conditions selective for cells 
expressing the marker gene(s). To produce such cells, the cells should be 
transfected with a sufficient concentration of subunit-encoding nucleic 
acids to form human neuronal nAChRs that contain the human subunits 
5 encoded by heterologous DNA. The precise amounts and ratios of DNA 
encoding the subunits may be empirically determined and optimized for a 
particular combination of subunits, cells and assay conditions. 
Recombinant cells that express neuronal nAChR containing subunits 
encoded only by the heterologous DNA or RNA are especially preferred. 

10 E. Recombinant nAChRs and nAChR Subunit Proteins 

Provided herein are substantially pure human nAChR subunit 
proteins, particularly human and fi^ subunit proteins. Also provided 
herein are recombinant nAChR containing at least one of the human 
nAChR subunit proteins. Thus, a further embodiment provided herein 

15 contains methods of producing recombinant human nAChR subunits and 
receptors containing the subunits. 

In preferred embodiments, DNA encoding human nAChR subunitfs), 
particularly human nAChR and/or ^3 subunits, is ligated into a vector, 
and the resulting construct is introduced into suitable host cells to 

20 produce transformed cell lines that express a specific human neuronal 
nAChR receptor subtype, or specific combinations of subtypes. The 
resulting cell lines can then be produced in quantity for reproducible 
quantitative analysis of the effects of drugs on receptor function. In 
other embodiments, mRNA may be produced by in vitro transcription of 

25 DNA encoding each subunit. This mRNA, either from a single subunit 
clone or from a combination of clones, can then be injected into Xenopus 
oocytes where the mRNA directs the synthesis of the human receptor 
subunits, which then form functional receptors. Alternatively, the 
subunit-encoding DNA can be directly injected into oocytes for expression 



wo 96/41876 



PCTAJS96/09775 



-29- 

of functional receptors. The transfected mammalian cells or injected 
oocytes may then be used in the methods of drug screening provided 
herein. 

The resulting recombinant cells may be cultured or subcultured (or 
5 passaged, in the case of mammalian cells) from such a culture or a 
subculture thereof. Methods for transfection, injection and culturing 
recombinant cells are known to the skilled artisan. Similarly, the human 
neuronal nicotinic nAChR subunits may be purified using protein 
purification methods known to those of skill in the art. For example, 

10 antibodies or other ligands that specifically bind to one or more of the 
subunits may be used for affinity purification of the subunit or human 
neuronal nAChRs containing the subunits. 

In accordance with one embodiment, methods for producing cells 
' that express human neuronal nAChR subunits and functional receptors 

15 are also provided. In one such method, host cells are transfected with 
DNA encoding at least one alpha subunit of a neuronal nAChR and at 
least one beta subunit of neuronal nAChR. Using methods such as 
northern blot or slot blot analysis, transfected cells that contain alpha 
and/or beta subunit encoding DNA or RNA can be selected. Transfected 

20 cells are also analyzed to identify those that express nAChR protein. 

Analysis can be carried out, for example, by measuring the ability of cells 
to bind acetylcholine, nicotine, or a nAChR agonist, compared to the 
nicotine binding ability of untransfected host cells or other suitable 
control cells, or by electrophysiologicaly monitoring the currents through 

25 the cell membrane in response to a nAChR agonist. 

In particularly preferred aspects, eukaryotic cells that contain 
heterologous DNA, express such DNA and form recombinant functional 
neuronal nAChR(s) are provided. In more preferred aspects, recombinant 
neuronal nAChR activity is readily detectable because it is a type that is 
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absent from the untransfected host cell or is of a magnitude not exhibited 
in the untransfected cell. Such cells that contain recombinant receptors 
could be prepared, for example, by causing cells transformed with DNA 
encoding the hunian neuronal nicotinic nAChR Oq and ^3 subunits to 
5 express the corresponding proteins in the presence or absence of one or 
more alpha and/or beta nAChR subunits. The resulting synthetic or 
recombinant receptor would contain the and 0^ nAChR subunits. Such 
a receptor would be useful for a variety of applications, e.g. . as part of an 
assay system free of the interferences frequently present in prior art 

10 assay systems employing non-human receptors or human tissue 

preparations. Furthermore, testing of single receptor subunits with a 
variety of potential agonists or antagonists would provide additional 
information with respect to the function and activity of the individual 
subunits. Such information may lead to the identification of compounds 

15 which are capable of very specific interaction with one or more of the 
receptor subunits. Such specificity may prove of great value in medical 
application. 

Thus, DNA encoding one or more human neuronal nAChR subunits 
may be introduced into suitable host cells (e.g. , eukaryotic or prokaryotic 

20 cells) for expression of individual subunits and functional nAChRs. 

Preferably combinations of alpha and beta subunits may be introduced 
into cells: such combinations include combinations of any one or more of 
02, cf^r a^, Oq and Oy with /?2, and/or ^4. Sequence information for 
each of these subunits is presented in the Sequence Listing provided 

25 herewith. Sequence information for is also presented in Proc. Natl. 
Acad. Sci. USA (1992) 89:1572-1576; sequence information for aj, a^, 

<^7' Pi and is also presented in PCT publication WO 94/20617, 
incorporated by reference herein. Presentiy preferred combinations of 
subunits include and/or with any one or more of atj, a^, a^, a^, P2 
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Pi^. It is recognized tliat some of the subunits may have ion transport 
function in the absence of additional subunits, while others require a 
combination of two or more subunits in order to display ion transport 
function. For example, the a-j subunit is functional in the absence of any 
5 added beta subunit. Furthermore, some of the subunits may not form 
functional nAChRs alone or in combination, but instead may modulate the 
properties of other nAChR subunit combinations. 

In certain embodiments, eukaryotic cells with heterologous human 
neuronal nAChRs are produced by introducing into the cells a first 

10 composition, which contains at least one RNA transcript that is translated 
in the cell into a subunit of a human neuronal nAChR. In preferred 
embodiments, the subunits that are translated include an alpha subunit of 
a human neuronal nAChR. More preferably, the composition that is 
introduced contains a RNA transcript which encodes an alpha subunit and 

15 also contains a RNA transcript which encodes a beta subunit of a human 
neuronal nAChR. RNA transcripts can be obtained from cells transfected 
with DNAs encoding human neuronal nAChR subunits or by in vitro 
transcription of subunit-encoding DNAs. Methods for in vitro 
transcription of cloned DNA and injection of the resulting mRNA into 

20 eukaryotic cells are well known in the art. Amphibian oocytes are 

particularly preferred for expression of in vitro transcripts of the human 
neuronal nAChR DNA clones. See e.q.> Dascal (1 989) CRC Grit. Rev. 
Biochem. 22:317-387, for a review of the use of Xenopus oocytes to 
study ion channels. 

25 Thus, a stepwise introduction into cells of DNA or RNA encoding 

one or more alpha subtypes, and one or more beta subtypes is possible. 
The resulting cells may be tested by the methods provided herein or 
known to those of skill in the art to detect functional nAChR activity. 
Such testing will allow the identification of combinations of alpha and 
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beta subunit subtypes that produce functional nAChRs, as well as 
individual subunits that produce functional nAChRs. 

Recombinant receptors on recombinant eukaryotic cell surfaces 
may contain one or more subunits encoded by the DNA or mRNA 
5 encoding human neuronal nAChR subunits, or may contain a mixture of 
subunits encoded by the host cell and subunits encoded by heterologous 
DNA or mRNA. Recombinant receptors may be homogeneous or may be 
a mixture of subtypes. Mixtures of DNA or mRNA encoding receptors 
from various species, such as rats and humans, may also be introduced 

10 into the cells. Thus, a cell may be prepared that expresses recombinant 
receptors containing only and 0^ subunits, or in combination with any 
other alpha and beta subunits provided herein. For example, either or 
both of the and fi^ subunits provided herein can be co-expressed with 
^3/ ^4* <^Bf ^7f fii and/or 0^ receptor subunits. As noted previously, 

15 some of the neuronal nAChR subunits may be capable of forming 

functional receptors in the absence of other subunits, thus co-expression 
is not always required to produce functional receptors. Moreover, some 
nAChR subunits may require co-expression with two or more nAChR 
subunits to participate in functional receptors. 

20 F. Assays 

i 

In accordance with one embodiment provided herein, recombinant 
human neuronal nAChR-expressing mammalian cells or oocytes can be 
contacted with a test compound, and the modulating effect(s) thereof 
can then be evaluated by comparing the nAChR-mediated response in the 
25 presence and absence of test compound, or by comparing the nAChR- 
mediated response of test cells, or control cells to the presence of the 
compound. 

As understood by those of skill in the art, assay methods for 
identifying compounds that modulate human neuronal nAChR activity 
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l e.a. . agonists and antagonists) generally require comparison to a control. 
As noted above, one type of a "control" cell or "control" culture is a cell 
or culture that is treated substantially the same as the cell or culture 
exposed to the test compound, except the control culture is not expose 
5 to test compound. For example, in methods that use voltage clamp . 
eletrophysiological procedures, the same ceil can be tested in the 
presence and absence of test compound, by merely changing the external 
solution bathing the cell. Another type of "control" cell or "control" 
culture may be a ceil or a culture of cells which are identical to the 

10 transfected cells, except the cells employed for the control culture do not 
express functional human neuronal nAChRs. In this situation, the 
response of test cell to test compound is compared to the response (or 
lack of response) of receptor-negative (control) cell to test compound, 
when cells or cultures of each type of cell are exposed to substantially 

15 the same reaction conditions in the presence of compound being assayed. 

Functional recombinant human neuronal nAChRs include at least an 
alpha subunit, or at least an alpha subunit and a beta subunit of a human 
neuronal nAChR. Eukaryotic ceils expressing these subunits have been 
prepared by injection of RNA transcripts and by transfection of DNA. 

20 Such cells have exhibited nAChR activity attributable to human neuronal 
nAChRs that contain one or more of the heterologous human neuronal 
nAChR subunits. 

With respect to measurement of the activity of functional 
heterologous human neuronal nAChRs, endogenous nAChR activity and, 

25 if desired, activity of nAChRs that contain a mixture of endogenous host 
cell subunits and heterologous subunits, should, if possible, be inhibited 
to a significant extent by chemical, pharmacological and 
electrophysiological means. 
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G. Antibodies 

Also provided herein are antibodies generated against the above- 
desribed nAChR subunits or portions thereof. Such antibodies may be 
employed for assessing receptor tissue localization, subtype composition, 
5 structure of functional domains, purification of receptors, as well as in 
diagnostic and therapeutic applications. Preferably for therapeutic 
applications, the antibodies employed will be monoclonal antibodies. 

The above-described antibodies can be prepared employing 
standard techniques, as are well known to those of skill in the art, using 

10 the nAChR subunit proteins, or portions thereof, described herein as 
antigens for antibody production. Both anti-peptide and anti-fusion 
protein antibodies can be used [see, for example, Bahouth et aL (1991) 
Trends Pharmacol. Sci. 12:338-343: Current Protocols in Molecular 
Bioloav (Ausubel et gL, eds.), John Wiley and Sons, New York (1989)]. 

15 Factors to consider in selecting portions of the nAChR subunits for use as 
immunogen (as either a synthetic peptide or a recombinantly produced 
bacterial fusion protein) include antigenicity, accessibility (i.e., 
extracellular and cytoplasmic domains), uniqueness to the particular 
subtype, and other factors known to those of skill in this art. 

20 The availability of subtype-specific antibodies makes possible the 

application of the technique of immunochemistry to monitor the 
distribution and expression density of various subtypes (e.g., in normal 
vs. diseased brain tissue). The antibodies produced using the human 
nAChR subunits as immunogens have, among other properties, the ability 

25 to specifically and preferentially bind to and/or cause the 

immunoprecipitation of human nAChR or a subunit thereof which may be 
present in a biological sample or a solution derived from such a sample. 
Such antibodies may also be used to selectively isolate cells that express 
human nAChR that contain the subunit for which the antibodies are 
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specific. Such antibodies could also be employed for diagnostic and 
therapeutic applications. In a further embodiment, there are provided 
methods for modulating the ion channel activity of nAChRs by contacting 
the receptors with an effective amount of the above-described antibodies. 
5 The antibodies herein can be administered to a subject employing 

standard methods, such as, for example, by intraperitoneal, 
intramuscular, intravenous, or subcutaneous injection, implant or 
transdermal modes of administration. One of skill in the art can readily 
determine dose forms, treatment regiments, etc., depending on the mode 
10 of administration employed. 

The following examples are included for illustrative purposes only 
and are not intended to limit the scope of the invention. 

EXAMPLE 1 

Isolation of DNA Encoding Human nAChR Subunits 
15 A human substantia nigra cDNA library (Clontech Laboratories, 

Inc.) was screened for hybridization to a fragment of the rat nAChR 
subunit cDNA. Isolated plaques were transferred to nitrocellulose filters 
and hybridization was performed in 5X Denhardt's, 5X SSPE, 50% 
formamide, 200 //g/ml denatured salmon sperm DNA and 0.2% SDS, at 
20 42*C. Washes were performed in 0.2X SSPE, 0.2% SDS, at 60°C. 

Five hybridizing clones were plaque-purified and characterized by 
restriction endonuclease mapping and DNA sequence analysis. 
The DNA sequence of the 5'- and 3' -ends of the cDNA inserts was 
determined using commercially available ^gt^O forward and reverse 
25 oligonucleotide primers. Analysis of the DNA sequence of the five cDNA 
inserts revealed that three clones contained the translational initiation 
codon, a full-length Oq open reading frame (nucleotides 143-1624 of SEQ 
ID N0:9), the translational stop codon and 142 additional nucleotides of 
5'- and 116 nucleotides of 3'- flanking sequences. The amino acid 
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sequence deduced from the nucleotide sequence of the full-length clone 
has -82% identity with the amino acid sequence deduced from the rat 
nAChR Oq subunit DNA. Several regions of the deduced rat and human 
amino acid sequences are notably dissimilar: 
5 amino acids 1-30 (the human signal sequence has only --56% 

identity with respect to the rat sequence), 

amino acids 31-50 (the human sequence has only —70% identity 
with respect to the rat sequence), 

amino acids 344-391 (the human sequence has only —40% 
identity with respect to the rat sequence), 

amino acids 401-428 (the human sequence has only -64% 
identity with respect to the rat sequence). 

Furthermore, the insert DNA of a single clone, KEa6.5, was 
determined to be missing 45 nucleotides of coding sequence, resulting 
in an in-frame deletion of 1 5 amino acid residues of the deduced amino 
acid sequence (residues 74 to 88 of SEQ ID NO: 10). The nucleotide 
sequence of an subunit variant lacking this sequence is shown in SEQ 
ID NO: 19 and the amino acid sequence deduced therefrom is shown in 
SEQ ID N0:20. Interestingly, the deduced amino acid sequence 
immediately downstream of the site of the deletion shares only - 58% 
amino acid identity with the deduced rat amino acid sequence (amino 
acids 89^100 of SEQ ID N0:10). 

EXAMPLE 2 

isolation of DNA Encoding A Human nAChR fi^ Subunit 

A human substantia nigra cDNA library (Clontech Laboratories, 
Inc.) was screened for hybridization to synthetic oligonucleotides 
complementary to the human nicotinic nAChR subunit cDNA. Isolated 
plaques were transferred to nitrocellulose filters and hybridized under high 
stringency conditions with respect to oligojiucleotides (washing 
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conditions IX SSPE, 0.2% SDS at 50"C) with synthetic oligonucleotides 
complementary to sequences of the human jffg nAChR subunit cDNA that 
include nucleotides 212-230 and 1442-1469 of SEQ ID NO: 15. 

Two hybridizing clones were plaque-purified and characterized by 
5 restriction endonuciease mapping. The DNA sequence of the 5'- and 3'- 
ends of the cDNA insert was determined using commercially available T7 
and SP6 oligonucleotide primers. The complete sequence of clone 
KBye3.2 was determined. Clone KB^3.2 contains a 1927 bp cDNA insert 
that contains a 1 ,377-nucleotide open reading frame encoding a full- 
ID length ^3 nAChR subunit (nucleotides 98-1471 SEQ ID N0:1 5) as well as 
97 nucleotides of 5'- and 454 nucleotides of 3'-untranslated sequence. 
The amino acid sequence deduced from the nucleotide sequence of the 
full-length clone has -81% identity with the amino acid sequence 
deduced from the rat nicotinic nAChR subunit DNA. Several regions of 
15 the deduced rat and human fi^ amino acid sequences are notably 
dissimilar: 

amino acids 1-28 (the human signal sequence has only -25% 
identity with respect to the rat sequence), 

amino acids 347-393 (the human sequence has only -55% 
20 identity with respect to the rat sequence), 

amino acids 440-464 (the human sequence has only -68% 
identity with respect to the rat sequence). 

EXAMPLE 3 

Preparation of Constructs for the Expression of Recombinant Human 
25 Neuronal nAChR Subunits 

Isolated cDNAs encoding human neuronal nAChR subunits were 

incorporated into vectors for use in expressing the subunits in mammalian 

host cells and for use in generating in vitro transcripts from the DNAs to 
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be expressed in Xenopus oocytes. The following vectors were utilized in 
preparing the constructs. 

A. Constructs for Expressing Human nAChR Subunits 

A 1,743 bp EcoR\ fragment encoding a full-length nAChR 
5 subunit, was isolated from KEa6.3 by standard methods and ligated into 
the £coRI polylinker site of the vector pcDNAS to generate pcDNAS- 
KEa6.3 (see Figure 1). Plasmid pcDNA3 (see Figure 1) is a pUC19-based 
vector that contains a CMV promoter/enhancer, a 17 bacteriophage RNA 
polymerase promoter positioned downstream of the CMV 

10 promoter/enhancer, a bovine growth hormone (BGH) polyadenylation 
signal downstream of the 17 promoter, and a polylinker between the 17 
promoter and the BGH polyadenylation signal. This vector thus contains 
all of the regulatory elements required for expression in a mammalian host 
cell of heterologous DNA which has been incorporated into the vector at 

15 the polylinker. In addition, because the 17 promoter is located just 

upstream of the polylinker, this plasmid can be used for the synthesis of 
in vitro transcripts of heterologous DNA that has been subcloned into the 
vector at the polylinker. Furthermore, this plasmid contains a gene 
encoding neomycin resistance used as a selectable marker during 

20 transf action. 

Figure 1 also shows a partial restriction map of pcDNA3-KEa6.3. 

The expression of the full-length human nAChR subunit was 
optimized by the introduction of a consensus ribosome binding site [RBS; 
see, e.g., Kozak (1991) J. Biol. Chem. 266 :19867-19870] prior to the 

25 translational start codon. The existing 5'-untranslated region was 

modified by PCR mutagenesis using the plasmid pcDNA3-KEa6.3 as a 
DNA template and a complementary upstream oligonucleotide containing 
the appropriate nucleotide RBS substitutions as well as flanking 5' HindlW 
and EcoRl sites, and an oligonucleotide cotnplementary to coding 
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sequences -450 nucleotides downstream of the translational start 
codon. The resulting amplification product contained Hind\l\ and EcoRI 
sites followed by the consensus RBS and nucleotides 1-459 of the human 
nAChR Oq coding sequence (nucleotides 143-602 of SEQ ID N0:9). The 
5 amplified DNA was digested with HindlW and BamH\; the 308-bp Hind\\\- 
BamH\ fragment was isolated and ligated with the 5.3 kb BamH\'Pvu\ 
fragment of pcDNA3-KEa6.3 and the 1 ,4-kb Pvu\ to Hind\\\ fragment from 
pcDNAS to generate the -7.0 kb plasmid pcDNA3-KEa6RBS. 
B. Constructs for Expressing Human Neuronal nAChR Subunits 
10 An -2.0 kb fcoRI fragment, encoding a full-length nicotinic AChR 

subunit, was Isolated from KB)93.2 by standard methods and ligated 
Into the £coRI polylinker site of the vector pcDNAS to generate pcDNAS- 
KBj93.2 (see Figure 2). Figure 2 also shows a partial restriction map of 
pcDNA3.KBi93.2. 

15 The expression of the full-length human nicotinic nAChR ^3 subunit 

is optimized by the introduction of a consensus ribosome binding site 
(RBS) prior to the translational start codon. The existing 5'-untranslated 
region is modified by PGR mutagenesis using a method similar to that 
described above for the nAChR subunit to generate pcDNA3-KB/?3RBS. 

20 EXAMPLE 4 

Expression of Recombinant Human Neuronal nAChR in Xenopus 

Xenopus oocytes are injected with in vitro transcripts prepared 
from constructs containing DNA encoding and P^ subunits. 
Electrophysiological measurements of the oocyte transmembrane currents 

25 are made using the two-electrode voltage clamp technique (see, e.o. . 
Stuhmer (1992) J\/leth. Enzymol. 207:310-339). 



V 
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1 . Preparation of in vitro transcripts 

Recombinant capped transcripts of pcDNAS-KEaRBS and pcDNAS- 
KBjffSRBS are syntiiesized from linearized plasmids using the mMessage 
and mlVlachine in vitro transcription Icit according to the capped transcript 
5 protocol provided by the manufacturer {Catalog 1344 from AMBION, Inc., 
Austin, TX). The mass of the synthesized transcripts is determined by 
UV absorbance and the integrity of the transcripts is determined by 
electrophoresis through an agarose gel. 

2. Electrophysiology 

10 Xenopus oocytes are injected with either 12.5, 50 or 125 ng of 

one or more human nicotinic nAChR a and fi subunit transcript per 
oocyte. The preparation and injection of oocytes is carried out as 
described by Dascal (1987) in Crit Rev. Biocitem. 22:317-387. Two-to- 
six days following mRNA injection, the oocytes are examined using the 

15 two-electrode voltage clamp technique. The cells are bathed in Ringer's 
solution (115 mM NaCI, 2.5 mM KCI, 1.8 mM CaCI^, 10 mM HEPES, pH 
7.3) containing 1 //M atropine with or without 100 //M d-tubocurarine. 
Cells are voltage-clamped at -60 to -80 mV. Data are acquired with 
Axotape soft\A^are at 2-5 Hz. The agonists acetylcholine (ACh), nicotine, 

20 and cytisine are added at concentrations ranging from 0.1 //M to 100 //M. 

EXAMPLE 5 

Recombinant Expression of Human nAChR Subunits in Mammalian Cells 

Human embryonic kidney (HEK) 293 cells are transiently and stably 
transfected with DNA encoding human neuronal nicotinic nAChR and 
25 P2 subunits. Transient transfectants are analyzed for expression of 

nicotinic nAChR using various assays, e.g. . electrophysiological methods, 
Ca^'^-sensitive fluorescent indicator-based assays. 
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1. Transient Transfection of HEK Cells 

HEK cells are transiently co-transfected with DNA encoding one or 
more a subunit and/or one or more P subunits. Approximately 2X10® 
HEK cells are transiently transfected with 18 //g of the indicated 
5 plasmid(s) according to standard CaP04 transfection procedures (Wigler 
et al. (1979) Proc. Natl. Acad, Sci. U.S.A. 76!l373-ia7fi^ or using 
lipofectamine according to the manufacturer's instructions (Bethesda 
Research Laboratory (BRL), Gaithersburg, MD). In addition, 2 jc/g of 
plasmid pCMV^ffgal (Clontech Laboratories, Palo Alto, CA), which contains 

10 the Escherichia co// iff-galactosidase gene fused to the CMV promoter, are 
co-transfected as a reporter gene for monitoring the efficiency of 
transfection. The transfectants are analyzed for i9-galactosidase 
expression by measurement of i?-galactosidase activity IMiller (1972) 
Experiment s in Molecular Genetics , pp. 352-355, Cold Spring Harbor 

15 Press]. Transfectants can also be analyzed for j9-galactosidase expression 
by direct staining of the product of a reaction involving )9-galactosidase 
and the X-gal substrate [Jones (1986) EI\/IBO 5:3133-3142]. 
2. Stable Transfection of HEK Cells 
HEK cells are transfected using the calcium phosphate transfection 

20 procedure [Current Protocols in Molecular Biology, Vol. 1, Wiley Inter- 
Science, Supplement 14, Unit 9.1.1-9.1.9 (1990)]. HEK cells are 
transfected with 1 ml of DNA/calcium phosphate precipitate containing 
the DNA encoding the desired alpha and beta subunits and pSV2neo (as a 
selectable marker). After 14 days of growth in medium containing 

25 1 //g/ml G418, colonies form and are individually isolated by using cloning 
cylinders. The isolates are subjected to limiting dilution and screened to 
identify those that expressed the highest level of nAChR, as described 
below. 
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EXAMPLE 6 

Characterization of Cell Lines Expressing Human Neuronal 

nAChRs 

Recombinant cell lines generated by transfection with DNA 
5 encoding human neuronal nAChR subunits, such as those described in 
EXAMPLE 5, can be further characterized using one or more of the 
following methods. 

A. Northern or slot blot analysis for expression of a- and/or 
^-subunit encoding messages 

10 Total RNA is isolated from -1X10^ cells and 10-15 //g of RNA 

from each cell type is used for Northern or slot blot hybridization analysis. 
The inserts from human neuronal nAChR-encoding plasmids can be nick- 
translated and used as probe. In addition, a fragment of the 
glyceraldehyde-3-phosphate dehyrodgenase (GAPD) gene sequence (Tso 

15 et aL (1 985) Nucleic Acids Res. 13:2485) can be nick-translated and 
used as a control probe on duplicate filters to confirm the presence or 
absence of RNA on each blot and to provide a rough standard for use in 
quantitating differences in a- or )?- specific mRNA levels between cell 
lines. Typicar Northern and slot blot hybridization and wash conditions 

20 are as follows: 

hybridization in 5x SSPE, 5X Denhardt's solution, 0.2% SDS, 200 //g/ml 
denatured, sonicated herring sperm DNA, 50% formamide, at 42*^0 
followed by washing in 0.1 x SSPE, 0.1% SDS, at 65°C. 

B. Binding assay 

25 Cell lines generated by transfection with human neuronal nAChR a* 

or a- and )9-subunit-encoding DNA can be analyzed for their ability to bind 
nicotine or other agonist, for example, as compared to control cell lines: 
e.g., neuronally-derived cell lines PCI 2 (Boulter ej ah (1986) Nature 
319:368-374: ATCC #CRL1721) and IMR32 (Clementi, el aL (1986) InL 

30 J, Neurochem. 47:291-297: ATCC #CCL127), and muscle-derived cell 
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line BC3H1 (Patrick, et al. (1977) J. Biol, Chem. 252 :2143-2153). 
Negative control cells (Le., host cells from which the transfectants were 
prepared) are also included in the assay. The assay is conducted as 
follows: 

5 Just prior to being assayed, transfected cells are removed from plates by 
scraping. Positive control cells used are PCI 2, BC3H1, and IMR32 
(which had been starved for fresh media for seven days). Control cell 
lines are removed by rinsing in 37'C assay buffer (50mM Tris/HCI, 1 mM 
MgCl2, 2 mM CaCljr 120 mM NaCI, 3 mM EDTA, 2 mg/ml BSA and 0.1% 

10 aprotinin at pH 7.4). The cells are washed and resuspended to a 

concentration of 1 x 10V250 //I. To each plastic assay tube is added 
250 fA of the cell solution, 1 5 nM ^H-nicotine, with or without 1 mM 
unlabeled nicotine, and assay buffer to make a final volume of 500 //I. 
The assays for the transfected ceil lines are incubated for 30 min at room 

15 temperature; the assays of the positive control cells are incubated for 2 
min at 1 ''C. After the appropriate incubation time, 450 //I aliquots of 
assay volume are filtered through Whatman GF/C glass fiber filters which 
have been pretreated by incubation in 0.05% polyethylenenimine for 24 
hours at 4°C. The filters are then washed twice, with 4 ml each wash, 

20 with ice cold assay buffer. After washing, the filters are dried, added to 
vials containing 5 ml scintillation fluid and radioactivity is measured. 
C. ®®Rb ion-flux assay 

The ability of nicotine or nAChR agonists and antagonists to 
mediate the influx of ®®Rb into transfected and control cells has been 
25 found to provide an indication of the presence of functional nAChRs on 
the cell surface. The ®®Rb ion-flux assay is conducted as follows: 
1 . The night before the experiment, cells are plated at 2 x 10^ per well 
( i.e. , 2 ml per well) in a 6-well polylysine-coated plate. 
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2. The culture medium is decanted and the plate washed with 2 ml of 
assay buffer (50 mM HEPES, 260 mM sucrose, 5.4 mM KCI, 1.8 mM 
CaClj, 0.8 mM MgS04, 5.5 mM glucose) at room temperature. 

3. The assay buffer is decanted and 1 ml of assay buffer, containing 3 
5 /yCi/ml ®^Rb, with 5mM ouabain and agoinist or antagonist in a 

concentration to effect a maximum response, is added. 

4. The plate is incubated on ice at 1 "^C for 4 min. 

5. The buffer is decanted into a waste container and each well was 
washed with 3 ml of assay buffer, followed by two washes of 2 ml each. 

10 6. The cells are lysed with 2 x 0.5 ml of 0.2% SDS per well and 
transferred to a scintillation vial containing 5 ml of scintillation fluid. 
7. The radioactivity contained in each vial 5 is measured and the data 
calculated. Positive control cells provided the following data in this assay: 
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*CCh = carbamyicholine 
^ 200//M nicotine 
^ 300//M CCh 
3mM CCh 
5 ^ ImM cytisine 

® 1 00 fjM cytisine 
^ nd = not determined 

D. Electrophysiological Analysis of Mammalian Cells 
Transfected with Human Neuronal nAChR Subunit- 
10 encoding DNA 

Eiectrophysioioglcal measurements may be used to assess the 

activity of recombinant receptors or to assess the ability of a test 

compound to potentiate, antagonize or otherwise modulate the magnitude 

and duration of the flow of cations through the ligand-gated recombinant 

15 nAChR. The function of the expressed neuronal nAChR can be assessed 
by a variety of electrophysiological techniques, including two-electrode 
voltage clamp and patch clamp methods. The cation-conducting channel 
intrinsic to the nAChR opens in response to acetylcholine (ACh) or other 
nicotinic cholinergic agonists, permitting the flow of transmembrane 

20 current carried predominantly by sodium and potassium ions under 
physiological conditions. This current can be monitored directly by 
voltage clamp techniques. In preferred embodiments, transfected 
mammalian cells or injected oocytes are analyzed eiectrophysiologically 
for the presence of nAChR agonist-dependent currents. 

25 E. Fluroescent Indicator-Based Assays 

Activation of the ligand-gated nAChR by agonists leads to an influx 
of cations, including Ca*^, through the receptor channel. Ca** entry into 
the cell through the channel can induce release of calcium contained in 
intracellular stores. Monovalent cation entry into the cell through the 

30 channel can also result in an increase in cytoplasmic Ca^'*^ levels through 
depolarization of the membrane and subsequent activation of voltage- 
dependent calcium channels. Therefore, methods of detecting transient 
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increases in Intracellular calcium concentration can be applied to the 
analysis of functional nicotinic nAChR expression. One method for 
measuring intracellular calcium levels relies on calcium-sensitive 
fluorescent indicators. 
5 Calcium-sensitive indicators, such as fluo-3 (Catalog No. F01241, 

Molecular Probes, Inc., Eugene, OR), are available as acetoxymethyl 
esters which are membrane permeable. When the acetoxymethyl ester 
form of the indicator enters a cell, the ester group is removed by 
cytosolic esterases, thereby trapping the free indicator in the cytosol. 

10 Interaction of the free indicator with calcium results in increased 

fluorescence of the indicator; therefore, an increase in the intracellular 
Ca^^ concentration of cells containing the indicator can be expressed 
directly as an increase in fluorescence. An automated fluorescence 
detection system for assaying nicotinic nAChR has been described (see, 

15 U.S. Patent Application Serial Nos. 08/229,150, 08/244,985, 

08/434,511, and 08/434,968 and corresponding published International 
PCT Patent Application No. US92/1 1090; see, also, published 
International PCT application No. 96/05488). 

HEK cells that are transiently or stably co-transfected with DMA 

20 encoding appropriate a and/or ^ subunits and and ^ffg subunits are 
analyzed for expression of functional recombinant nAChR using the 
automated fluorescent indicator-based assay. The assay procedure is as 
follows. Untransfected HEK cells and HEK cells co-transfected with DNA 
encoding the appropriate a and ^ subunits are plated in the wells of a 96- 

25 well microtiter dish and loaded with fluo-S by incubation for 2 hours at 
20''C in a medium containing 20 //M fluo-3, 0.2% Pluronic F-127 in HBS 
(125 mM NaCI, 5 mM KCI, 1.8 mM CaClj, 0.62 mM MgSO^, 6 mM 
glucose, 20 mM HEPES, pH 7.4). The cells are then washed with assay 
buffer (Le^, HBS). The antagonist d-tubocurarine is added to some of the 
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wells at a final concentration of 10 /jM. The microtiter dish is then 
placed into a fluorescence plate reader and the basal fluorescence of each 
well is measured and recorded before addition of agonist e.g. , 200 fjM 
nicotine, to the wells. The fluorescence of the wells is monitored 
5 repeatedly during a period of approximately 60 seconds following addition 
of nicotine. 

The fluorescence of the untransfected HEK cells does not change 
after addition of nicotine. In contrast, the fluorescence of the co- 
transfected cells, in absence of d-tubocurarine, increases dramatically 
10 after addition of nicotine to the wells. This nicotine-stimulated increase 
in fluorescence is not observed in co-transfected cells that had been 
exposed to the antagonist d-tubocurarine. Such results demonstrate that 
the co-transfected cells express functional recombinant nAChR that are 
activated by nicotine and blocked by d-tubocurarine. 

15 

While the invention has been described in detail with reference to 
certain preferred embodiments thereof, it will be understood that 
modifications and variations are within the spirit and scope of that which 
is described and claimed. 
20 Since modifications will be apparent to those of skill in this art, it is 

intended that this invention be limited only by the scope of the appended 
claims. 
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SEQUENCE LISTING 

GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: SIBIA NEUROSCIENCES , INC. 

(B) STREET: 505 S. Coast Blvd. 

(C) CITY: La Jolla 

(D) STATE: California 

(E) COUNTRY: USA 

(F) POSTAL CODE (ZIP) : 92037 

(i) INVENTOR/APPLICANT: 

(A) NAME: Kathryn J. Elliott 

(B) STREET: 3854 Baker Street 

(C) CITY: San Diego 

(D) STATE: California 

(D) COUNTRY: USA 

(E) POSTAL CODE (ZIP) : 92117 

(i) INVENTOR/ APPLICANT: 

(A) NAME: Michael M. Harpold 

(B) STREET: 15630 Creek Hills Road 

(C) CITY: El Cajon 

(D) STATE: California 

(D) COUNTRY: USA 

(E) POSTAL CODE (ZIP) : 92021 

(ii) TITLE OF INVENTION: HUMAN NEURONAL NICOTINIC ACETYLCHOLINE 
RECEPTOR COMPOSITIONS AND METHODS EMPLOYING SAME 

(iii) NUMBER OF SEQUENCES: 20 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Brown, Martin, Haller &. McClain 

(B) STREET: 1660 Union Street 

(C) CITY: San Diego 

(D) STATE: CA 

(E) COUNTRY: USA 

(F) ZIP: 92101-2926 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 1.5 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: June 7, 1996 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/484 , 722 

(B) FILING DATE: 06/07/95 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Seidman, Stephanie L 

(B) REGISTRATION NUMBER: 33,779 

(C) REFERENCE/DOCKET NUMBER: 6362-9370PC 
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(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 619-238-0999 

(B) TELEFAX: 619-238-0062 

(C) TELEX: 



(2) INFOHMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2664 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRA(»1ENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 555... 2141 

(D) OTHER INFORMATION: alpha2 subunit of human neuronal 

nicotinic acetylcholine receptor 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1. . .554 
(D) OTHER INFORMATION: 



(A) NAME/KEY: 3'UTR 

(B) LOCATION: 2142... 2666 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GAGAGAACAG CGTGAGCCTG TGTGCTTGTG TGCTGAGCCC TCATCCCCTC CTGGGGCCAG 60 

GCTTGGGTTT CACCTGCAGA ATCGCTTGTG CTGGGCTGCC TGGGCTGTCC TCAGTGGCAC 120 

CTGCATGAAG CCGTTCTGGC TGCCAGAGCT GGACAGCCCC AGGAAAACCC ACCTCTCTGC 180 

AGAGCTTGCC CAGCTGTCCC CGGGAAGCCA AATGCCTCTC ATGTAAGTCT TCTGCTCGAC 240 

GGGGTGTCTC CTAAACCCTC ACTCTTCAGC CTCTGTTTGA CCATGAAATG AAOTGACTGA 300 

GCTCTATTCT GTACCTGCCA CTCTATTTCT GGGGTGACTT TTGTCAGCTG CCCAGAATCT 360 

CCAAGCCAGG CTGGTTCTCT GCATCCTTTC AATGACCTGT TTTCTTCTGT AACCACAGGT 420 

TCGGTGGTGA GAGGAAGCCT CGCAGAATCC AGCAGAATCC TCACAGAATC CAGCAGCAGC 480 

TCTGCTGGGG ACATGGTCCA TGGTGCAACC CACAGCAAAG CCCTGACCTG ACCTCCTGAT 540 

GCTCAGGAGA AGCC ATG GGC CCC TCC TGT CCT GTG TTC CTG TCC TTC ACA 590 
Met Gly Pro Ser Cys Pro Val Phe Leu Ser Phe Thr 
15 10 

AAG CTC AGC CTG TGG TGG CTC CTT CTG ACC CCA GCA GGT GGA GAG GAA 638 
Lys Leu Ser Leu Trp Trp Leu Leu Leu Thr Pro Ala Gly Gly Glu Glu 
15 20 25 

GCT AAG CGC CCA CCT CCC AGG GCT CCT GGA GAC CCA CTC TCC TCT CCC 686 
Ala Lys Arg Pro Pro Pro Arg Ala Pro Gly Asp Pro Leu Ser Ser Pro 
30 35 40 



AGT CCC ACG GCA TTG CCG CAG GGA GGC TCG CAT ACC GAG ACT GAG GAC 
Ser Pro Thr Ala Leu Pro Gin Gly Gly Ser His Thr Glu Thr Glu Asp 



734 
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45 50 55 . 60 

CGG CTC TTC AAA CAC CTC TTC CGG GGC TAC AAC CGC TGG GCG CGC CCG 782 
Arg I*eu Phe Lys His Leu Phe Arg Gly Tyr Asn Arg Trp Ala Arg Pro 
65 70 75 

GTG CCC AAC ACT TCA GAC GTG GTG ATT GTG CGC TTT GGA CTG TCC ATC 830 
Val Pro Asn Thr Ser Asp Val Val lie Val Arg Phe Gly Leu Ser lie 
80 85 90 

OCT CAG CTC ATC GAT GTG GAT GAG AAG AAC CAA ATG ATG ACC ACC AAC 878 
Ala Gin Leu lie Asp Val Asp Glu Lys Asn Gin Met Met Thr Thr Asn 
95 100 105 

GTC TGG CTA AAA CAG GAG TGG AGC GAC TAC AAA CTG CGC TGG AAC CCC 926 
Val Trp Leu Lys Gin Glu Trp Ser Asp Tyr Lys Leu Arg Trp Asn Pro 
110 115 120 

OCT GAT TTT GGC AAC ATC ACA TCT CTC AGG GTC CCT TCT GAG ATG ATC 974 
Ala Asp Phe Gly Asn He Thr Ser Leu Arg Val Pro Ser Glu Met lie 
125 130 135 140 

TGG ATC CCC GAC ATT GTT CTC TAC AAC AAT GCA GAT GGG GAG TTT GCA 1022 
Trp He Pro Asp He Val Leu Tyr Asn Asn Ala Asp Gly Glu Phe Ala 
145 150 155 

GTG ACC CAC ATG ACC AAG GCC CAC CTC TTC TCC ACG GGC ACT GTG CAC 1070 
Val Thr His Met Thr Lys Ala His Leu Phe Ser Thr Gly Thr Val His 
160 165 170 

TGG GTG CCC CCG GCC ATC TAC AAG AGC TCC TGC AGC ATC GAC GTC ACC 1118 
Trp Val Pro Pro Ala He Tyr Lys Ser Ser Cys Ser He Asp Val Thr 
175 180 185 

TTC TTC CCC TTC GAC CAG CAG AAC TGC AAG ATG AAG TTT GGC TCC TGG 1166 
Phe Phe Pro Phe Asp Gin Gin Asn Cys Lys Met Lys Phe Gly Ser Trp 
. 190 195 200 

ACT TAT GAC AAG GCC AAG ATC GAC CTG GAG CAG ATG GAG CAG ACT GTG 1214 
Thr Tyr Asp Lys Ala Lys He Asp Leu Glu Gin Met Glu Gin Thr Val 
205 210 215 220 

GAC CTG AAG GAC TAC TOG GAG AGC GGC GAG TGG GCC ATC GTC AAT GCC 1262 
Asp Leu Lys Asp Tyr Trp Glu Ser Gly Glu Trp Ala He Val Asn Ala 
225 230 235 

ACG GGC ACC TAC AAC AGC AAG AAG TAC GAC TGC TGC GCC GAG ATC TAC 1310 
Thr Gly Thr Tyr Asn Ser Lys Lys Tyr Asp Cys Cys Ala Glu He Tyr 
240 245 250 

CCC GAC GTC ACC TAC GCC TTC GTC ATC CGG CGG CTG CCG CTC TTC TAC 1358 
Pro Asp Val Thr Tyr Ala Phe Val He Arg Arg Leu Pro Leu Phe Tyr 
255 260 265 

ACC ATC AAC CTC ATC ATC CCC TGC CTG CTC ATC TCC TGC CTC ACT GTG 1406 
Thr He Asn Leu He He Pro Cys Leu Leu He Ser Cys Leu Thr Val 
270 275 280 

CTG GTC TTC TAC CTG CCC TCC GAC TGC GGC GAG AAG ATC ACG CTG TGC 1454 
Leu Val Phe Tyr Leu Pro Ser Asp Cys Gly Glu Lys He Thr Leu Cys 
285 290 295 300 
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ATT TCG GTG CTG CTG TCA CTC ACC GTC TTC CTG CTG CTC ATC ACT GAG 1502 
lie Ser Val Leu Leu Ser Leu Thr Val Phe Leu Leu Leu lie Thr Glu 
305 310 315 

ATC ATC CCG TCC ACC TCG CTG GTC ATC CCG CTC ATC GGC GAG TAG CTG 1550 
lie lie Pro Ser Thr Ser Leu Val He Pro Leu He Gly Glu Tyr Leu 
320 325 330 

CTG TTC ACC ATG ATC TTC GTC ACC CTG TCC ATC GTC ATC ACC GTC TTC 1598 
Leu Phe Thr Met He Phe Val Thr Leu Ser He Val He Thr Val Phe 
335 340 345 

GTG CTC AAT GTG CAC CAC CGC TCC CCC AGC ACC CAC ACC ATG CCC CAC 1646 
Val Leu Asn Val His His Arg Ser Pro Ser Thr His Thr Met Pro His 
350 355 360 

TGG GTG CGG GGG GCC CTT CTG GGC TGT GTG CCC CGG TGG CTT CTG ATG 1694 
Trp Val Arg Gly Ala Leu Leu Gly Cys Val Pro Arg Trp Leu Leu Met 
365 370 375 380 

AAC CGG CCC CCA CCA CCC GTG GAG CTC TGC CAC CCC CTA CGC CTG AAG 1742 
Asn Arg Pro Pro Pro Pro Val Glu Leu Cys His Pro Leu Arg Leu Lys 
385 390 395 

CTC AGC CCC TCT TAT CAC TGG CTG GAG AGC AAC GTG GAT GCC GAG GAG 1790 
Leu Ser Pro Ser Tyr His Trp Leu Glu Ser Asn Val Asp Ala Glu Glu 
400 405 410 

AGG GAG GTG GTG GTG GAG GAG GAG GAC AGA TGG GCA TGT GCA GGT CAT 1838 
Arg Glu Val Val Val Glu Glu Glu Asp Arg Trp Ala Cys Ala Gly His 
415 420 425 

GTG GCC CCC TCT GTG GGC ACC CTC TGC AGC CAC GGC CAC CTG CAC TCT 1886 
Val Ala Pro Ser Val Gly Thr Leu Cys Ser His Gly His Leu His Ser 
430 435 440 

GGG GCC TCA GGT CCC AAG GCT GAG GCT CTG CTG CAG GAG GGT GAG CTG 1934 
Gly Ala Ser Gly Pro Lys Ala Glu Ala Leu Leu Gin Glu Gly Glu Leu 
445 450 455 460 

CTG CTA TCA CCC CAC ATG CAG AAG GCA CTG GAA GGT GTG CAC TAC ATT 1982 
Leu Leu Ser Pro His Met Gin Lys Ala Leu Glu Gly Val His Tyr He 
465 470 475 

GCC GAC CAC CTG CGG TCT GAG GAT GCT GAC TCT TCG GTG AAG GAG GAC 2030 
Ala Asp His Leu Arg Ser Glu Asp Ala Asp Ser Ser Val Lys Glu Asp 
480 485 490 

TGG AAG TAT GTT GCC ATG GTC ATC GAC AGG ATC TTC CTC TGG CTG TTT 2078 
Trp Lys Tyr Val Ala Met Val He Asp Arg He Phe Leu Trp Leu Phe 
495 500 505 

ATC ATC GTC TGC TTC CTG GGG ACC ATC GGC CTC TTT CTG CCT CCG TTC 2126 
He He Val Cys Phe Leu Gly Thr He Gly Leu Phe Leu Pro Pro Phe 
510 515 520 

CTA GCT GGA ATG ATC TGACTGCACC TCCCTCGAGC TGGCTCCCAG GGCAAAGGGG AG 2183 

Leu Ala Gly Met He 

525 



GGTTCTTGGA TGTGGAAGGG CTTTGAACAA TGTTTAGATT TGGAGATGAG CCCAAAGTGC 2243 
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CAGGGAGAAC AGCCAGGTGA GGTGGGAGGT TGGAGAGCCA GGTGAGGTCT CTCTAAGTCA 2303 

GGCTGGGGTT GAAGTTTGGA GTCTGTCCGA GTTTGCAGGG TGCTGAGCTG TATGGTCCAG 2363 

CAGGGGAGTA ATAAGGGCTC TTCCGGAAGG GGA66AA6CG GGAG6CAGGC CTGCACCTGA 2423 

TGTGGAGGTA CAGGCAGATC TTCCCTACC6 66GAGGGATG GATGGTTGGA TACA6GTGGC 2483 

TGGGCTATTC CATCCATCTG QAAGCACATT TGAGCCTCCA GGCTTCTCCT TGACGTCATT 2543 

CCTCTCCTTC CTTGCTGCAA AATGGCTCTG CACCAGCCGG CCCCCAGGAG GTCTGGCAGA 2603 

6CTGAGAGCC ATGGCCTGCA GGGGCTCCAT ATGTCCCTAC GCGT6CAGCA GGCAAACAAG 2663 

A 2664 



(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 529 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met 


Gly 


Pro 


Ser 


Cys 


Pro 


Val 


Phe 


1 








5 








Trp 


Trp 


Leu 


Leu 


Leu 


Thr 


Pro 


Ala 




20 










Pro 


Pro 


Arg 
35 


Ala 


Pro 


Gly 


Asp 


Pro 
40 


Leu 


Pro 
50 


Gin 


Gly 


Gly 


Ser 


His 
55 


Thr 


His 


Leu 


Phe 


Arg 


Gly 


Tyr 


Asn 


Arg 


65 










70 






Ser 


Asp 


Val 


Val 


He 


Val 


Arg 


Phe 








85 








Asp 


Val 


Asp 


Glu 
100 


Lys 


Asn 


Gin 


Met 


Gin 


Glu 


Trp 
115 


Ser 


Asp 


Tyr 


Lys 


Leu 
120 


Asn 


He 
130 


Thr 


Ser 


Leu 


Arg 


Val 
135 


Pro 


He 


Val 


Leu 


Tyr 


Asn 


Asn 


Ala 


Asp 


145 










150 






Thr 


Lys 


Ala 


His 


Leu 


Phe 


Ser 


Thr 








165 








Ala 


He 


Tyr 


Lys 
180 


Ser 


Ser 


Cys 


Ser 


Asp 


Gin 


Gin 
195 


Asn 


Cys 


Lys 


Met 


Lys 
200 


Ala 


Lys 
210 


He 


Asp 


Leu 


Glu 


Gin 
215 


Met 


Tyr 


Trp 


Glu 


Ser 


Gly 


Glu 


Trp 


Ala 


225 










230 






Asn 


Ser 


Lys 


Lys 


Tyr 
245 


Asp 


Cys 


Cys 


Tyr 


Ala 


Phe 


Val 
260 


He 


Arg 


Arg 


Leu 


He 


He 


Pro 


Cys 


Leu 


Leu 


He 


Ser 






275 








280 



Leu 


Ser 


Phe 


Thr Lys Leu Ser Leu 




10 




15 


Gly 


Gly 


Glu 


Glu Ala Lys Ajrg Pro 


25 






30 


Leu 


Ser 


Ser 


Pro Ser Pro Thr Ala 








45 


Glu 


Thr 


Glu 


Asp Arg Ijeu Phe Lys 








60 


Trp 


Ala 


Arg 


Pro Val Pro Asn Thr 






75 


80 


Gly 


Leu 


Ser 


He Ala Gin Leu He 




90 




95 


Met 


Thr 


Thr 


Asn Val Trp Leu Lys 


105 






110 


Arg 


Trp 


Asn 


Pro Ala Asp Phe Gly 








125 


Ser 


Glu 


Met 


He Trp He Pro Asp 








140 


Gly 


Glu 


Phe 


Ala Val Thr His Met 




155 


160 


Gly 


Thr 


Val 


His Trp Val Pro Pro 




170 




175 


He 


Asp 


Val 


Thr Phe Phe Pro Phe 


185 




190 


Phe 


Gly 


Ser 


Trp Thr Tyr Asp Lys 








205 


Glu 


Gin 


Thr 


Val Asp Leu Lys Asp 








220 


He 


Val 


Asn 


Ala Thr Gly Thr Tyr 






235 


240 


Ala 


Glu 


He 


Tyr Pro Asp Val Thr 




250 




255 


Pro 


Leu 


Phe 


Tyr Thr He Asn Leu 


265 






270 


Cys 


Leu 


Thr 


Val Leu Val Phe Tyr 



285 
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Leu 


Pro 


Ser 


Asp Cys Gly Glu Lys 




290 




295 


Leu 


Ser 


Leu 


Thr Val Phe Leu Leu 


305 






310 


Thr 


Ser 


Leu 


Val He Pro Leu He 








325 


lie 


Phe 


Val 


Thr Leu Ser He Val 








340 


His 


His 


Arg 


Ser Pro Ser Thr His 






355 


360 


Ala 


Leu 


Ijeu 


Gly Cys Val Pro Arg 




370 




375 


Pro 


Pro 


Val 


Glu Leu Cys His Pro 


385 






390 


Tvr 


His 




Leu Glu Ser Asn Val 






Val 


Glu 


Glu 


Glu Asn Aica Tim Ala 








420 


Val 


Gly 


Thr 


Leu Cys Ser His Gly 






435 


440 


Pro 


Lys 


Ala 


Glu Ala Leu Leu Gin 




450 




455 


His 


Met 


Gin 


Lys Ala Leu Glu Gly 


465 






470 


Arg 


Ser 


Glu 


Asp Ala Asp Ser Ser 








485 


Ala 


Met 


Val 


He Asp Arg He Phe 








500 


Phe 


Leu 


Gly 


Thr He Gly Leu Phe 






515 


520 


He 









He Thr 


Leu 


Cys He Ser Val 


Xieu 






300 




Leu He 


Thr 


Glu He He Pro 


Ser 




315 




320 


Gly Glu 


Tyr 


Leu Leu Phe Thr 


Met 


330 




335 




He Thr 


Val 


Phe Val Leu Asn 


Val 

vox 


345 




350 




Thr Met 


Pro 


His Trp Val Arg 


Glv 






365 




Trp Leu 


Leu 


Met Asn Arg Pro 


Pro 






380 




Leu Arg 


Leu 


Lys Leu Ser Pro 


Ser 




395 




400 


Asp Ala 


Glu 


Glu Arg Glu Val 


Val 


410 




415 




Cys Ala 


Gly 


His Val Ala Pro 


Ser 


425 




430 




His Leu 


His 


Ser Gly Ala Ser 


Gly 






445 




Glu Gly 


Glu 


Leu Leu Leu Ser 


Pro 






460 




Val His 


Tyr 


He Ala Asp His 


Leu 




475 




480 


Val Lys 


Glu 


Asp Trp Lys Tyr 


Val 


490 




495 




Leu Trp 


Leu 


Phe He He Val 


Cys 


505 




510 


Leu Pro 


Pro 


Phe Leu Ala Gly 


Met 



525 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1908 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES5 : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 190... 1704 

(D) OTHER INFORMATION: alpha3 subunit human neuronal 

nicotinic acetylcholine receptor 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1...189 
(D) OTHER INFORMATION: 



(A) NAME/KEY: 3'UTR 

(B) LOCATION: 1705... 1908 
(D) OTHER INFORMATION: 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

CCTGTCCTCC CGCGGGTCCG AGGGCGCTGG AAACCCAGCG GCG6CGAAGC GGAGAGGAGC 60 

CCCGCGCGTC TCCGCCCGCA CGGCTCCAGG TCTGGGGTCT GCGCTGGAGC CGCGCGGGGA 120 

GAGGCCGTCT CTGCGACCGC CGCGCCCGCT CCCGACCGTC CGGGTCCGCG GCCAGCCCGG 180 

CCACCAGCC ATG GGC TCT GGC COG CTC TCG CTG CCC CTG GCG CTG TCG COG 231 
Met Gly Ser Gly Pro Leu Ser Leu Pro Leu Ala Leu Ser Pro 
15 10 

CCG C6G CTG CTG CTG CTG CTG CTG TCT CTG CTG CCA GTG GCC AGG GCC 279 
Pro Arg Leu I^eu Leu Leu Leu Leu Ser Leu. Leu Pro Val Ala Arg Ala 
15 20 25 30 

TCA GAG GCT GAG CAC CGT CTA TTT GAG CGG CTG TTT GAA GAT TAG AAT 327 
Ser Glu Rla Glu His Arg Leu Phe Glu Arg Leu Phe Glu Asp Tyr Asn 
35 40 45 

GAG ATC ATC CGG CCT GTA GCC AAC GTG TCT GAC CCA GTC ATC ATC CAT 375 
Glu lie lie Arg Pro Val Ala Asn Val Ser Asp Pro Val He He His 
50 55 60 

TTC GAG GTG TCC ATG TCT CAG CTG GTG AAG GTG GAT GAA GTA AAC CAG 423 
Phe Glu Val Ser Met Ser Gin Leu Val Lys Val Asp Glu Val Asn Gin 
65 70 75 

ATC ATG GAG ACC AAC CTG TGG CTC AAG CAA ATC TGG AAT GAC TAG AAG 471 
He Met Glu Thr Asn Leu Trp Leu Lys Gin He Trp Asn Asp Tyr Lys 
80 85 90 

CTG AAG TGG AAC CCC TCT GAC TAT GGT GGG GCA GAG TTC ATG CGT GTC 519 
Leu Lys Trp Asn Pro Ser Asp Tyr Gly Gly Ala Glu Phe Met Arg Val 
95 100 105 110 

CCT GCA CAG AAG ATC TGG AAG CCA GAC ATT GTG CTG TAT AAC AAT GCT 567 
Pro Ala Gin Lys He Trp Lys Pro Asp He Val Leu Tyr Asn Asn Ala 
115 120 125 

GTT GGG GAT TTC CAG GTG GAC GAC AAG ACC AAA GCC TTA CTC AAG TAG 615 
Val Gly Asp Phe Gin Val Asp Asp Lys Thr Lys Ala Leu Leu Lys Tyr 
130 135 140 

ACT GGG GAG GTG ACT TGG ATA CCT CCG GCC ATC TTT AAG AGG TGG TGT 663 
Thr Gly Glu Val Thr Txp He Pro Pro Ala He Phe Lys Ser Ser Cys 
145 150 155 

AAA ATC GAC GTG ACC TAG TTC CCG TTT GAT TAG CAA AAC TGT ACC ATG 711 
Lys He Asp Val Thr Tyr Phe Pro Phe Asp Tyr Gin Asn Cys Thr Met 
160 165 170 

AAG TTC GGT TCC TGG TCC TAG GAT AAG GCG AAA ATG GAT GTG GTC CTG 759 
Lys Phe Gly Ser Trp Ser Tyr Asp Lys Ala Lys He Asp Leu Val Leu 
175 180 185 190 

ATC GGC TGT TGG ATG AAC CTG AAG GAC TAT TGG GAG AGC GGG GAG TGG 807 
He Gly Ser Ser Met Asn Leu Lys Asp Tyr Trp Glu Ser Gly Glu Trp 
195 200 205 



GCC ATC ATC AAA GCC 
Ala He He Lys Ala 



CCA GGG TAG AAA GAC GAC ATC AAG TAG AAC TGG 
Pro Gly Tyr Lys His Asp He Lys Tyr Asn Cys 



855 



wo 96/41876 



PCTAJS96/09775 



-55- 



210 215 220 

TGC GAG GAG ATC TAC CCC GAC ATC ACA TAC TCG CTG TAG ATC CGG CGC 903 
Cys Glu Glu lie Tyr Pro Asp lie Thr Tyr Ser Leu Tyr lie Arg Arg 
225 230 235 

CTG CCC TTG TTC TAC ACC ATC AAC CTC ATC ATC CCC TGC CTG CTC ATC 951 
Leu Pro Leu Phe Tyr Thr lie Asn Leu lie lie Pro Cys Leu Leu lie 
240 245 250 

TCC TTC CTC ACT GTG CTC GTC TTC TAC CTG CCC TCC GAC TGC GGT GAG 999 
Ser Phe Leu Thr Val Leu Val Phe Tyr Leu Pro Ser Asp Cys Gly Glu 
255 260 265 270 

AAG GTG ACC CTG TGC ATT TCT GTC CTC CTC TCC CTG ACG GTG TTT CTC 1047 
Lys Val Thr Leu Cys lie Ser Val Leu Leu Ser Leu Thr Val Phe Leu 
275 280 285 

CTG GTG ATC ACT GAG ACC ATC CCT TCC ACC TCG CTG GTC ATC CCC CTG 1095 
Leu Val He Thr Glu Thr He Pro Ser Thr Ser Leu Val He Pro Leu 
290 295 300 

ATT GGA GAG TAC CTC CTG TTC ACC ATG ATT TTT GTA ACC TTG TCC ATC 1143 
He Gly Glu Tyr Leu Leu Phe Thr Met He Phe Val Thr Leu Ser He 
305 310 315 

GTC ATC ACC GTC TTC GTG CTC AAC GTG CAC TAC AGA ACC CCG ACG ACA 1191 
Val He Thr Val Phe Val Leu Asn Val His Tyr Arg Thr Pro Thr Thr 
320 325 330 

CAC ACA ATG CCC TCA TGG GTG AAG ACT GTA TTC TTG AAC CTG CTC CCC 1239 
His Thr Met Pro Ser Trp Val Lys Thr Val Phe Leu Asn Leu Leu Pro 
335 340 345 350 

AGG GTC ATG TTC ATG ACC AGG CCA ACA AGC AAC GAG GGC AAC GCT CAG 1287 
Arg Val Met Phe Met Thr Arg Pro Thr Ser Asn Glu Gly Asn Ala Gin 
355 360 365 

AAG CCG AGG CCC CTC TAC GGT GCC GAG CTC TCA AAT CTG AAT TGC TTC 1335 
Lys Pro Arg Pro Leu Tyr Gly Ala Glu Leu Ser Asn Leu Asn Cys Phe 
370 375 380 

i 

AGC CGC GCA GAG TCC AAA GGC TGC AAG GAG GGC TAC CCC TGC CAG GAC 1383 
Ser Arg Ala Glu Ser Lys Gly Cys Lys Glu Gly Tyr Pro Cys Gin Asp 
385 390 395 

GGG ATG TGT GGT TAC TGC CAC CAC CGC AGG ATA AAA ATC TCC AAT TTC 1431 
Gly Met Cys Gly Tyr Cys His His Arg Arg He Lys He Ser Asn Phe 
400 405 410 

AGT GCT AAC CTC ACG AGA AGC TCT AGT TCT GAA TCT GTT GAT GCT GTG 1479 
Ser Ala Asn Leu Thr Arg Ser Ser Ser Ser Glu Ser Val Asp Ala Val 
415 420 425 430 

CTG TCC CTC TCT GCT TTG TCA CCA GAA ATC AAA GAA GCC ATC CAA AGT 1527 
Leu Ser Leu Ser Ala Leu Ser Pro Glu He Lys Glu Ala He Gin Ser 
435 440 445 

GTC AAG TAT ATT GCT GTUV AAT ATG AAA GCA CAA AAT GAA GCC AAA GAG 1575 
Val Lys Tyr He Ala Glu Asn Met Lys Ala Gin Asn Glu Ala Lys Glu 
450 455 460 
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ATT CAA GAT GAT TGG AAG TAT GTT GCC ATG GTG ATT GAT CGT ATT TTT 1623 
lie Gin Asp Asp Trp Lys Tyr Val Ala Met Val lie Asp Arg He Phe 
465 470 475 

CTG TGG GTT TTC ACC CTG GTG TGC ATT CTA GGG ACA GCA GGA TTG TTT 1671 
Leu Trp Val Phe Thr Leu Val Cys He Leu Gly Thr Ala Gly Leu Phe 
480 485 490 

CTG CAA CCC CTG ATG GCC AGG GAA GAT GCA TAA GCACTAAGCT GTGTGCCTGC 1724 
Leu Gin Pro Leu Met Ala Arg Glu Asp Ala * 
495 500 505 

CTGGGAGACT TCCTTGTGTC AGGGCAGGAG GAGGCTGCTT CCTAGTAAGA ACGTACTTTC 1784 

TGTTATCAAG CTACCAGCTT TGTTTTTGGC ATTTCGAGGT TTACTTATTT TCCACTTATC 1844 

TTGGAATCAT GCAAAAAAAA AATGTCAAGA GTATTTATTA CCGATAAATG AACATTTAAC 1904 

TAGC 1908 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 



Met 


Gly 


Ser 


Gly 


Pro 


Leu 


Ser 


Leu 


Pro 


Leu 


Ala 


Leu Ser Pro Pro Arg 


1 




5 










10 




15 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Ser 


Leu 


Leu 


Pro 


Val 


Ala Arg Ala Ser Glu 








20 










25 






30 


Ala 


Glu 


His 


Arg 


Leu 


Phe 


Glu 


Arg 


Leu 


Phe 


Glu 


Asp Tyr Asn Glu He 






35 










40 








45 


He 


Arg 


Pro 


Val 


Ala 


Asn 


Val 


Ser 


Asp 


Pro 


Val 


He He His Phe Glu 




50 










55 










60 


Val 


Ser 


Met 


Ser 


Gin 


Leu 


Val 


Lys 


Val 


Asp 


Glu 


Val Asn Gin He Met 


65 










70 










75 


80 


Glu 


Thr 


Asn 


Leu 


Trp 


Leu 


Lys 


Gin 


He 


Trp 


Asn 


Asp Tyr Lys Leu Lys 










85 










90 




95. 


Trp 


Asn 


Pro 


Ser 


Asp 


Tyr 


Gly 


Gly 


Ala 


Glu 


Phe 


Met Arg Val Pro Ala 






100 










105 






110 


Gin 


Lys 


He 


Trp 


Lys 


Pro 


Asp 


He 


val 


Leu 


Tyr 


Asn Asn Ala Val Gly 




115 










120 








125 


Asp 


Phe 


Gin 


Val 


Asp 


Asp 


Lys 


Thr 


Lys 


Ala 


Leu 


Leu Lys Tyr Thr Gly 


130 










135 










140 


Glu 


Val 


Thr 


Trp 


He 


Pro 


Pro 


Ala 


He 


Phe 


Lys 


Ser Ser Cys Lys He 


145 








150 










155 


160 


Asp 


Val 


Thr 


Tyr 


Phe 


Pro 


Phe 


Asp 


Tyr 


Gin 


Asn 


Cys Thr Met Lys Phe 






165 










170 




175 


Gly 


Ser 


Trp 


Ser 


Tyr 


Asp 


Lys 


Ala 


Lys 


He 


Asp 


Leu Val Leu He Gly 






180 










185 






190 


Ser 


Ser 


Met 


Asn 


Leu 


Lys 


Asp 


Tyr 


Trp 


Glu 


Ser 


Gly Glu Trp Ala He 






195 






200 








205 


He 


Lys 


Ala 


Pro 


Gly 


Tyr 


Lys 


His 


Asp 


He 


Lys 


Tyr Asn Cys Cys Glu 



210 215 220 
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Gill 


Tie 


Xyx ir^O Asp xxe XIlx xyx 


Ser 


Leu 


Tyr 


xxe nxg i\rg jjcu irro 






o "art 






oo c 


A 

240 


L6U 


oil A 


Tyr Thr He Asn Leu He 


He 


Pro 


Cys 


Leu Leu He Ser Phe 






245 




250 




255 




XIlx 


VclX JjeU VaX IrXie lyT ijcu 


Pro 


Ser 


Asp 


Cys Gly Glu Lys Val 






Of A 

2o0 


265 






270 




T 


\myo xxe oex vsij. ueu jjcu 


Ser 


Leu 


Jixir 


vax irne Xieu Xieu vax 






O T C O O A 

279 2oO 








285 


Tip 


Thr" 


V3XVL XUX XXC Xrxt,/ OCX AUX 


Ser 


Leu 


Vax 


Xxe irxo xisu xxe uxy 




o on 


2^3 








O A A 

300 


fit 14 


Tyr 


jjeu Lieu Fue xxir nec xxe 


Phe 


Val 


Thr 


Leu ser xxe vax xxe 


one 




O *l A 






315 


320 


IXlx 




jriie vdx jjeu Asn vax uxs 


Tyr Arg 


Tnr 


Fro Tnr Tnr nxs Tnr 






325 




330 




335 


nec 


Fro 


ser Trp Veii jjys TXir votx 


Phe 


Leu 


Asn 


Leu Leu Pro Arg Val 






A f% 
340 


345 






1 e n 

350 




TliT* Tiirrt 1*Vf^ Coy* 
neti ±x*x AX^ irxw xxix ocx 


Asn 


Glu 


G1 V 
uxy 


A en &1 s Gl n T.i^re 
ASn nXa VtXIl XiyS IrxO 






c c ^ £ A 

355 360 






•s ^ c 

3o5 




pro 


Leu Tyr Gly Ala Glu Leu 


Ser 


Asn 


Leu 


Asn Cys Phe Ser Arg 




370 


375 








380 


Ala 


Gill 


OCX Xiyo VjXjr ^jro Xjya \9XU 


Gly Tyr 


Pro 


C*\ra Gin Acsn Gl 

\^y3 vaxxi vixy pxet 


385 




390 






395 


400 


Cys 


Gly 


Tyr Cys His His Arg Arg 


He 


Lys 


He 


Ser Asn Phe Ser Ala 






405 




410 




415 


Asn 


Leu 


Thr Arg Ser Ser Ser Ser 


Glu 


Ser 


Val 


Asp Ala Val Leu Ser 






420 


425 






430 


Leu 


Ser 


Ala Leu Ser Pro Glu He 


Lys 


Glu 


Ala 


He Gin Ser Val Lys 






435 440 








445 


Tyr 


He 


Ala Glu Asn Met Lys Ala 


Gin 


Asn 


Glu 


Ala Lys Glu He Gin 




450 


455 








460 


Asp 


Asp 


Trp Lys Tyr Val Ala Met 


Val 


He 


Asp 


Arg He Phe Leu Trp 


465 




470 






475 


480 


Val 


Phe 


Thr Leu Val Cys He Leu 


Gly Thr 


Ala 


Gly Leu Phe Leu Gin 






485 




490 




495 


Pro 


Leu 


Met Ala Arg Glu Asp Ala 











500 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 232... 2115 

(D) OTHER INFORMATION: alpha4 s\abunit human neuronal 

nicotinic acetylcholine receptor 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1...231 
(D) OTHER INFORMATION: 
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(A) NAME/KEY: 3'UTR 

(B) LOCATION: 2116... 3496 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

TCCCAGCCGG CTGAGGCGGG CAGGGCCGGG CGGGGCCGCG CCACGGAGTC CACAGCCCGG 60 
CGCTCCCTGC CGCGCCGCCG CCGCACCGCG CCCCACAGGA GAAGACGAAC CGGGCCCGGC 120 
GGCCGAAGCG GCCCGCGAGG CGCGGGAGGC ATGAAGTTGG GCGCGCACGG GCCTCGAAGC 180 
GGCGGGGAGC CGGGAGCCGC CCGCATCTAG A6CCCGCGAG GTGCGTGCGC C ATG GAG 237 

Met Glu 
1 

CTA GGG GGC CCC GGA GCG CCG CGG CTG CTG COG CCG CTG CTG CTG CTT 285 
Leu Gly Gly Pro Gly Ala Pro Arg Leu Leu Pro Pro Leu Leu Leu Leu 
5 10 15 

CTG GGG ACC GGC CTC CTG CGC GCC AGC AGC CAT GTG GAG ACC CGG GCC 333 
Leu Gly Thr Gly Leu Leu Arg Ala Ser Ser His Val Glu Thr Arg Ala 
20 25 30 

CAC GCC GAG GAG CGG CTC CTG AAG AAA CTC TTC TCC 6GT TAC AAC AAG 381 
His Ala Glu Glu Arg Leu Leu Lys Lys Leu Phe Ser Gly Tyr Asn Lys 
35 40 45 50 

TGG TCC CGA CCC GTG GCC AAC ATC TCG GAC GTG QTC CTC GTC CGC TTC 429 
Trp Ser Arg Pro Val Ala Asn lie Ser Asp Val Val Leu Val Arg Phe 
55 60 65 

GGC CTG TCC ATC GCT CAG CTC ATT GAC GTG GAT GAG AAG AAC CAG ATG 477 
Gly Leu Ser He Ala Gin Leu He Asp Val Asp Glu Lys Asn Gin Met 
70 75 80 

ATG ACC ACG AAC 6TA TGG GTG AAG CAG GAG TGG CAC GAC TAC AAG CTG 525 
Met Thr Thr Asn Val Trp Val Lys Gin Glu Trp His Asp Tyr Lys Leu 
85 90 95 

CGC TGG GAC CCA GCT GAC TAT GAG AAT GTC ACC TCC ATC CGC ATC CCC 573 
Arg Trp Asp Pro Ala Asp Tyr Glu Asn Val Thr Ser He Arg He Pro 
100 105 110 

TCC GAG CTC ATC TGG CGG CCG GAC ATC GTC CTC TAC AAC AAT GCT GAC 621 
Ser Glu Leu He Trp Arg Pro Asp He Val Leu Tyr Asn Asn Ala Asp 
115 120 125 130 

GGG GAC TTC GCG GTC ACC CAC CTG ACC AAG GCC CAC CTG TTC CAT GAC 669 
Gly Asp Phe Ala Val Thr His Leu Thr Lys Ala His Leu Phe His Asp 
135 . 140 145 

GGG CGG GTG CAG TGG ACT CCC CCG GCC ATT TAC AAG AGC TCC TGC AGC 717 
Gly Arg Val Gin Trp Thr Pro Pro Ala He Tyr Lys Ser Ser Cys Ser 
150 155 160 

ATC GAC GTC ACC TTC TTC CCC TTC GAC CAG CAG AAC TGC ACC ATG AAA 765 
He Asp Val Thr Phe Phe Pro Phe Asp Gin Gin Asn Cys Thr Met Lys 
165 170 175 

TTC GGC TCC TGG ACC TAC GAC AAG GCC AAG ATC GAC CTG GTG AAC ATG 813 
Phe Gly Ser Trp Thr Tyr Asp Lys Ala Lys He Asp Leu Val Asn Met 
180 185 190 
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CAC AGC CGC GTG GAC CAG CTG GAC TTC TGG GAG AGT GGC GAG TGG GTC 861 
His Ser Arg Val Asp Gin Leu Asp Phe Trp Glu Ser Gly Glu Trp Val 
195 200 205 210 

ATC GTG GAC GCC GTG GGC ACC TAC AAC ACC AGG AAG TAG GAG TGC TGC 909 
He Val Asp Ala Val Gly Thr Tyr Asn Thr Arg Lys Tyr Glu Cys Cys 
215 220 225 

GCC GAG ATC TAC CCG GAC ATC ACC TAT GCC TTC GTC ATC CGG CGG CTG 957 
Ala Glu He Tyr Pro Asp He Thr Tyr Ala Phe Val He Arg Arg Leu 
230 235 . 240 

CCG CTC TTC TAC ACC ATC AAC CTC ATC ATC CCC TGC CTG CTC ATC TCC 1005 
Pro Leu Phe Tyr Thr He Asn Leu He He Pro Cys Leu Leu He Ser 
245 250 255 

TGC CTC ACC GTG CTG GTC TTC TAC CTG CCC TCC GAG TGT GGC GAG AAG 1053 
Cys Leu Thr Val Leu Val Phe Tyr Leu Pro Ser Glu Cys Gly Glu Lys 
260 265 270 

ATC ACQ CTG TGC ATC TCC GTG CTG CTG TCG CTC ACC GTC TTC CTG CTG 1101 
He Thr Leu Cys He Ser Val Leu Leu Ser Leu Thr Val Phe Leu Leu 
275 280 285 290 

CTC ATC ACC GAG ATC ATC CCG TCC ACC TCA CTG GTC ATC CCA CTC ATC 1149 
Leu He Thr Glu He He Pro Ser Thr Ser Leu Val He Pro Leu He 
295 300 305 

GGC GAG TAC CTG CTG TTC ACC ATG ATC TTC GTC ACC CTG TCC ATC GTC 1197 
Gly Glu Tyr Leu Leu Phe Thr Met He Phe Val Thr Leu Ser He Val 
310 315 320 

ATC ACG GTC TTC GTG CTC AAC GTG CAC CAC CGC TCG CCA CGC ACG CAC 1245 
He Thr Val Phe Val Leu Asn Val His His Arg Ser Pro Arg Thr His 
325 330 335 

ACC ATG CCC ACC TGG GTA CGC AGG GTC TTC CTG GAC ATC GTG CCA CGC 1293 
Thr Met Pro Thr Trp Val Arg Arg Val Phe Leu Asp He Val Pro Arg 
340 345 350 

CTG CTC CTC ATG AAG CGG CCG TCC GTG GTC AAG GAC AAT TGC CGG CGG 1341 
Leu Leu Leu Met Lys Arg Pro Ser Val Val Lys Asp Asn Cys Arg Arg 
355 360 365 370 

CTC ATC GAG TCC ATG CAT AAG ATG GCC AGT GCC CCG CGC TTC TGG CCC 1389 
Leu He Glu Ser Met His Lys Met Ala Ser Ala Pro Arg Phe Trp Pro 
375 380 385 

GAG CCA GAA GGG GAG CCC CCT GCC ACG AGC GGC ACC CAG AGC CTG CAC 1437 
Glu Pro Glu Gly Glu Pro Pro Ala Thr Ser Gly Thr Gin Ser Leu His 
390 395 400 

CCT CCC TCA CCG TCC TTC TGC GTC CCC CTG GAT GTG CCG GCT GAG CCT 1485 
Pro Pro Ser Pro Ser Phe Cys Val Pro Leu Asp Val Pro Ala Glu Pro 
405 410 415 

GGG CCT TCC TGC AAG TCA CCC TCC GAC CAG CTC CCT CCT CAG CAG CCC 1533 
Gly Pro Ser Cys Lys Ser Pro Ser Asp Gin Leu Pro Pro Gin Gin Pro 
420 425 430 

CTG GAA GCT GAG AAA GCC AGC CCC CAC CCC TCG CCT GGA CCC TGC CGC 1581 
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Leu Glu Ala Glu Lys Ala Ser Pro His Pro Ser Pro Gly Pro Cys Arg 
435 440 445 450 



CCG CCC CAC GGC ACC CAG GCA CCA GGG CTG GCC AAA GCC AGG TCC CTC 
Pro Pro His Gly Thr Gin Ala Pro Gly Leu Ala Lys Ala Arg Ser Leu 
455 460 465 



1629 



AGC GTC CAG CAC ATG TCC AGC CCT GGC GAA GCG GTG GAA GGC GGC GTC 
Ser Val Gin His Met Ser Ser Pro Gly Glu Ala Val Glu Gly Gly Val 
470 475 480 



1677 



CGG TGC CGG TCT CGG AGC ATC CAG TAC TGT GTT CCC CGA 6AC GAT GCC 
Arg Cys Arg Ser Arg Ser lie Gin Tyr Cys Val Pro Arg Asp Asp Ala 
485 490 495 



1725 



GCC CCC GAG GCA GAT GGC CAG GCT GCC GGC GCC CTG GCC TCT CGC AAC 
Ala Pro Glu Ala Asp Gly Gin Ala Ala Gly Ala Leu Ala Ser Arg Asn 
500 505 510 



1773 



ACC CAC TCG GCT GAG CTC CCA CCC CCA GAC CAG CCC TCT CCG TGC AAA 
Thr His Ser Ala Glu Leu Pro Pro Pro Asp Gin Pro Ser Pro Cys Lys 
515 520 525 530 



1821 



TGC ACA TGC AAG AAS GAG CCC TCT TCG GTG TCC CCG AGC GCC ACG GTC 
Cys Thr Cys Lys Lys Glu Pro Ser Ser Val Ser Pro Ser Ala Thr Val 
535 540 545 



1869 



AAG ACC CGC AGC ACC AAA GCG CCG CCC CCG CAC CTG CCC CTG TCG CCG 
Lys Thr Arg Ser Thr Lys Ala Pro Pro Pro His Leu Pro Leu Ser Pro 
550 555 560 



1917 



GCC CTG ACC CGG GCG GTG GAG GGC GTC CAG TAC ATT GCA GAC CAC CTG 
Ala Leu Thr Arg Ala Val Glu Gly Val Gin Tyr He Ala Asp His Leu 
565 570 575 



1965 



AAG GCC GAA GAC ACA GAC TTC TCG GTG AAG GAG GAC TGG AAG TAC GTG 
Lys Ala Glu Asp Thr Asp Phe Ser Val Lys Glu Asp Trp Lys Tyr Val 
580 585 590 



2013 



GCC ATG GTC ATC GAC CGC ATC TTC CTC TGG ATG TTC ATC ATC GTC TGC 
Ala Met Val He Asp Arg He Phe Leu Trp Met Phe He He Val Cys 
595 600 605 610 



2061 



CTG CTG GGG ACG GTG GGC CTC TTC CTG CCG CCC TGG CTG GCT GGC ATG 
Leu Leu Gly Thr Val Gly Leu Phe Leu Pro Pro Trp Leu Ala Gly Met 
615 620 625 



2109 



ATC TAG GAAGGGACCG GGAGCCTGCG TGGCCTGGGG CTGCCGTGCA CGGGGCCAGC ATC 
He * 



2168 



CATGCGGCCG 
CCAAATTTTC 
CTGGGGAGAC 
TGGAGCAGAG 
CCCCTGCGCA 
ACTCTGCCTT 
TAGTGGCTTC 
GAATGGAGTC 
CGCAGCCCGG 
AGCTGTGGGG 



GCCTGGGGCC 
CTTCCTGTTC 
CGAGTGTGGA 
GTGGGGGTCG 
GCCCTCCGCG 
CCAGGCGTAG 
AGCCCCTGGG 
CAGACCTGGG 
GAGATCCGCC 
AGCTGGTGGG 



GGGCTGGCTT 
TGTGTCTGCT 
GCTGCTTCCA 
CCGCCTTCTA 
GCGGACAGGA 
GGCCAGGGCT 
GTACTTCTGT 
CCCTGGTTCC 
CTGGGCTCTG 
GGCATCCTTG 



CTCCCTGGAC 
GTAAGACGGC 
GTTGGACTGT 
CCTGCAGGAC 
ACACCAGCCC 
CTGGCAGGTG 
GTTGTGATTC 
CCCAGGACCC 
6GTTCGGGAA 
AGGACCTCCA 



TCTGTGGGGC 
CTTGGACGGG 
GGCCTCAGGA 
TCGGGCTAAG 
CAGCGAGTCT 
GCCAGGGCTC 
CCCGGAGCTG 
TGAGGGTTTC 
GAAGGACTTC 
CCTGGGAGAT 



CACACGTTTG 
GACACGGCCT 
GGCAGTGGCT 
TCCAGCTCTC 
GGAGACCAGG 
CACGGGGGGC 
GGAAGGTCCC 
CACCTTGGCG 
CTGCTACAGT 
GCTGGGACCC 



2228 
2288 
2348 
2408 
2468 
2528 
2588 
2648 
2708 
2768 
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TCGGGGCAGG AAGTCCCTGA GAAGCCTCAT GGGAGTCAGG GAGCCCTGGG GTTTCCACAC 2828 

AGGCCCATGC CCTCCGTCCT GGCAGGGCAG GCAGAGCTCA GCACAGCCTC ACCCCTGCAG 2888 

GCGGTATCCA GAGGTGAGGG AGGCCTGAAA TGTTTCCAGG CATGACCCTG GAGCCCGGCA 2948 

GTGCACCCCC TAAAGATGGC GCACCCGGCA GCCCCCCATT GTCCCCAGGG GCACACTTCC 3008 

CCCTTGGGAT GGGCACAGCC TGCCCCACCC CTCCATGATT CCAAGGGCCA AGAQGGGCGG 3068 

GGCCAGGATG GCTTTTCCCC TGCCTGTGAQ TGACATCGGT TCAGGAGGAG ACAGTCAGGA 3128 

AGCCTCCTGC TGAGTGGTCC ACATTCTGCT GCCCCCAQAC CCCATCCAGC CAGGGGTGGG 3188 

GATGGGGTTG GGCTCTGCGT CCCACTGAGT CTCATTCCTC TGTCCCCGAG CCGAGCTCTC 3248 

CTGGGCCAGG GTCTCGTCAG GAGGTGCCTG AGAGCAGAAT GAATAATTGA GGTTAGGAAC 3308 

CCGGCATGCC 6AGTGCCCCA GAAATGCCGC TGTGTNCCCC GCGGGCAGTG ACGTGAGTGG 3368 

GGAGGAGACT CAGGCCCACA TTGCCCACAC CTGCCTCTGA ACTGCTGCTG GTCACCCCCA 3428 

CCCCCGGGTG CCTGTGACCG GGGTCCTGAG GCTGGGGCTT TTGTGCCAGG AGTGGGTGGG 3488 

ACACAGAG 3495 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 628 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: N- terminal 





(vi) ORIGINAL 


SOURCE: 


















(xi) SEQUENCE 


DESCRIPTION; 


: SEQ ID 


N0:6: 






Met 


Glu 


Leu 


Gly 


Gly 


Pro 


Gly 


Ala 


Pro 


Arg 


Leu 


Leu Pro Pro 


Leu 


Leu 


1 








5 










10 






15 




Leu 


Leu 


Leu 


Gly 


Thr 


Gly 


Leu 


Leu 


Arg 


Ala 


Ser 


Ser His Val 


Glu 


Thr 








20 










25 






30 






Arg 


Ala 


His 


Ala 


Glu 


Glu 


Arg 


Leu 


Leu 


Lys 


Lys 


Leu Phe Ser Gly Tyr 






35 










40 








45 






Asn 


Lys 


Trp 


Ser 


Arg 


Pro 


Val 


Ala 


Asn 


He 


Ser 


Asp Val Val 


Leu 


Val 




50 










55 










60 






Arg 


Phe 


Gly 


Leu 


Ser 


He 


Ala 


Gin 


Leu 


He 


Asp 


Val Asp Glu 


Lys 


Asn 


65 










70 










75 






80 


Gin 


Met 


Met 


Thr 


Thr 


Asn 


Val 


Trp 


Val 


Lys 


Gin 


Glu Trp His 


Asp 


Tyr 










85 










90 






95 




Lys 


Leu 


Arg 


Trp 


Asp 


Pro 


Ala 


Asp 


Tyr 


Glu 


Asn 


Val Thr Ser 


He 


Arg 








100 










105 






110 






He 


Pro 


Ser 


Glu 


Leu 


He 


Trp 


Arg 


Pro 


Asp 


He 


Val Leu Tyr 


Asn 


Asn 






115 










120 








125 






Ala 


Asp 


Gly 


Asp 


Phe 


Ala 


Val 


Thr 


His 


Leu 


Thr 


Lys Ala His 


Leu 


Phe 




130 










135 










140 






His 


Asp 


Gly 


Arg 


Val 


Gin 


Trp 


Thr 


Pro 


Pro 


Ala 


He Tyr Lys 


Ser 


Ser 


145 










150 










155 






160 


Cys 


Ser 


He 


Asp 


Val 


Thr 


Phe 


Phe 


Pro 


Phe 


Asp 


Gin Gin Asn 


Cys 


Thr 










165 










170 






175 




Met 


Lys 


Phe 


Gly 


Ser 


Trp 


Thr 


Tyr 


Asp 


Lys 


Ala 


Lys He Asp 


Leu 


Val 








180 










185 






190 






Asn 


Met 


His 


Ser 


Arg 


Val 


Asp 


Gin 


Leu 


Asp 


Phe 


Trp Glu Ser Gly Glu 






195 










200 








205 






Trp 


Val 


He 


Val 


Asp 


Ala 


Val 


Gly 


Thr 


Tyr 


Asn 


Thr Arg Lys 


Tyr 


Glu 




210 










215 










220 






Cys 


Cys 


Ala 


Glu 


He 


Tyr 


Pro 


Asp 


He 


Thr 


Tyr 


Ala Phe Val 


He 


Arg 


225 










230 










235 


1 




240 


Arg 


Leu 


Pro 


Leu 


Phe 


Tyr 


Thr 


He 


Asn 


Leu 


He 


He Pro Cys 


Leu 


Leu 



wo 96/41876 



PCT/US96/09775 



-62- 









245 










250 


255 


He 


Ser 


Cys 


Leu Thr 


Val 


Leu 


Val 


Phe 


Tyr Leu 


Pro Ser Glu Cva Glv 








260 








265 




270 

AIM 


Glu 


Lvs 


He 


Thr Leu 


Cvs 


He 


Ser 


Val 




QpT* T.aii Tlir* Val P>io 
OCX ueu xiix vax IrXlC 






275 








280 






^03 


Leu 


Leu 


Leu 


He Thr 


Glu 


He 


He 


Pro 


OCX XllX 


Cav T^ii Val Tl a P%*#h 
OCX iJcU VaX XXc irXO 




290 
















1 flft 
J UU 


Leu. 


He 


Glv 


Glu Tyr 


Leu 


Leu 






Met He 


PVia Yfal 'P^v* T AY* Cav> 

iriie vax mr jueu ser 


305 








310 










<9 n 
320 


He 


Val 


He 


Thr Val 


Phe 


Val 




A fin 


Val TTi fi 
vax Axs 


V*! fi 3iv*rT Cav* lkv>,M 

Axs Axg OCX fro Arg 




















*3 ^ e 
j33 


Thr 


His 


Thr 


Met Pro 


Thr 




Val 




Arg Val 


PVta T.aii Abti TIa Val 
xrxie jjcu Asp XXc vax 








340 








3A5 
•943 




J30 


Pro 




Leu 


Leu Leu 


Met 




iixg 


PvA 


OCX Vox 


VaX LiyS iiSp ASu WyS 






355 








3ftn 






003 


Arcr 


Arci 


Leu 


He Glu 


Ser 


Met 


His 


T.vei 


Maf^ Ala 
t'iC AX Ol 


Cav &1 a P^/^ &v>rr PVia 
OCX AXa irxO nxg <rllc 




370 








375 










Pro 


Glu 


Pro Glu 


Glv 


Glu 






2Vl a Tlrf* 

AXd X XIX 


Cav fll XT 'PViv* Al n Gav* 

OCX vaxy xjuLx vtxxi ocr 


385 








^7 V 








J73 


400 


Leu 


His 


Pro 


Pro Ser 




OCX 


PHa 


cys 


vax irxO 


jjeu Asp vax jpro ax a 








405 








ft 

4XU 


4X3 


Glu 


Pro 


Glv 


Pro Ser 






ser 


Pro 


Ser Asp 


^! 1 1*1 T_Avi T)w*^^ ^^M»afc W% 

\3Xii lieu Fro rro ivxn 
















A*) e 
423 


430 


Gin 




XJCU 




UXU 


Lys 


iixa 


Ser 


Pro His 


Pro Ser Pro Gly Pro 














AAA 






445 








PT-n His 


Glv 
oxy 


XilX 


Gl n 


211 a 

nxa 


Pro Gly 


ijeu Axa jjys Axa Arg 


















460 


OCX 


<Ut5U 




V AX OXU 


n-i a 

AX 9 




Ser 


oer 


Pro Gly 


vsxu Axa vax gxu Gxy 


















475 


480 




veix 


Arg 


Cys Arg 


Ser 


Arg 


Ser 


x±e 


Gin Tyr 


Cys Val Pro Arg Asp 








>1 o c 










490 


495 


Asp 




Axa 


Pro Glu 


Aj.a 


ASp 


Qxy 


Gin 


Ala Ala 


Gly Ala Leu Ala Ser 








DUO 








505 




510 


Arg 


As 21 


ijir 


His Ser 


Axa 


r*l 11 

te^U 


Xjeu 


Pro 


Pro Pro 


Asp Gin Pro Ser Pro 






515 








520 






525 


Cys 


Lys 


Cys 


Thr Cys 


Lys 


Lys 


Glu 


Pro 


Ser Ser 


Val Ser Pro Ser Ala 




530 








535 








540 


Thr 


Val 


Lys 


Thr Arg 


Ser 


Thr 


Lys 


Ala 


Pro Pro 


Pro His Leu Pro Leu 


545 








550 








555 


560 


Ser 


Pro 


Ala 


Leu Thr 


Arg 


Ala 


Val 


Glu 


Gly Val 


Gin Tyr lie Ala Asp 








565 










570 


575 


His 


Leu 


Lys 


Ala Glu 


Asp 


Thr 


Asp 


Phe 


Ser Val 


Lys Glu Asp Trp liys 








580 








565 




590 


Tyr 


Val 


Ala 


Met Val 


He 


Asp 


Arg 


He 


Phe Leu 


Trp Met Phe He He 






595 








600 






605 


Val 


Cys 


Leu 


Leu Gly 


Thr 


Val 


Gly 


Leu 


Phe Leu 


Pro Pro Trp Leu Ala 




610 








615 








620 


Gly 


Met 


He 
















625 





















(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1828 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 
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(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 155... 1561 

(D) OTHER INFORMATION: alphas subunit human neuronal 

nicotinic acetylcholine receptor 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1...154 
(D) OTHER INFORMATION: 



(A) NAME/KEY: 3'UTR 

(B) LOCATION: 1562... 1828 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CCCGGCGGGA GCTGTGGCGC GGAGCGGCCC CGCTGCTGCG TCTGCCCTCG TTTTGTCTCA 60 
CGACTCACAC TCAGT6CT6C ATTCCCCAAG AGTTCGCGTT CCCCGC6CGG CGGTCGAGAG 120 
GCGGCTGCCC GCGGTCCCGC GCGGGCGCGG GGCG ATG GCG GCG CGG GGG TCA GGG 175 

Met Ala Ala Arg Gly Ser Gly 
1 5 

CCC CGC GCG CTC CGC CTG CTG CTC TTG GTC CAG CTG GTC GCG GGG CGC 223 
Pro Arg Ala Leu Arg Leu Leu Leu Leu Val Gin Leu Val Ala Gly Arg 
10 15 20 

TGC GGT CTA GCG GGC GCG GCG GGC GGC GCG CAG AGA GGA TTA TCT GAA 271 
Cys Gly Leu Ala Gly Ala Ala Gly Gly Ala Gin Arg Gly Leu Ser Glu 
25 30 35 

CCT TCT TCT ATT GCA AAA CAT GAA GAT AGT TTG CTT AAG GAT TTA TTT 319 
Pro Ser Ser lie Ala Lye His Glu Asp Ser Leu Leu Lys Asp Leu Phe 
40 45 50 55 

C7^ GAC TAC GAA AGA TGG GTT CGT CCT GTG GAA CAC CTG AAT GAC AAA 367 
Gin Asp Tyr Glu Arg Trp Val Arg Pro Val Glu His Leu Asn Asp Lys 
60 65 70 

ATA AAA ATA AAA TTT GGA CTT GCA ATA TCT CAA TTG GTG GAT GTG GAT 415 
lie Lys lie Lys Phe Gly Leu Ala lie Ser Gin Leu Val Asp Val Asp 
75 80 85 

GAG AAA AAT CAG TTA ATG ACA ACA AAC GTC TGG TTG AAA CAG GAA TGG 463 
Glu Lys Asn Gin Leu Met Thr Thr Asn Val Trp Leu Lys Gin Glu Trp 
90 95 100 

ATA GAT GTA AAA TTA AGA TGG AAC CCT GAT GAC TAT GGT GGA ATA AAA 511 
lie Asp Val Lys Leu Arg Trp Asn Pro Asp Asp Tyr Gly Gly He Lys 
105 110 115 

GTT ATA CGT GTT CCT TCA GAC TCT GTC TGG ACA CCA GAC ATC GTT TTG 559 
Val He Arg Val Pro Ser Asp Ser Val Trp Thr Pro Asp He Val Leu 
120 125 130 135 

TTT GAT AAT GCA GAT GGA CGT TTT GAA GGG ACC AGT ACG AAA ACA GTC 607 
Phe Asp Asn Ala Asp Gly Arg Phe Glu Gly Thr Ser Thr Lys Thr Val 
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140 145 150 

ATC AGG TAG AAT GGC ACT GTC ACC TGG ACT CCA CCG GCA AAC TAC AAA 655 
He Arg Tyr Asn Gly Thr Val Thr Trp Thr Pro Pro Ala Asn Tyr Lys 
155 160 165 

AGT TCC TGT ACC ATA GAT GTC ACG TTT TTC CCA TTT GAC CTT CAG AAC 703 
Ser Ser Cys Thr He Asp Val Thr Phe Phe Pro Phe Asp Leu Gin Asn 
170 175 180 

TGT TCC ATG AAA TTT GGT TCT TGG ACT TAT GAT GGA TCA CAG GTT GAT 751 
Cys Ser Met Lys Phe Gly Ser Trp Thr Tyr Asp Gly Ser Gin Val Asp 
185 190 195 

ATA ATT CTA GAG GAC CAA GAT GTA GAC AAG AGA GAT TTT TTT GAT AAT 799 
He lie I^u Glu Asp Gin Asp Val Asp Lys Arg Asp Phe Phe Asp Asn 
200 205 210 215 

GGA GAA TGG GAG ATT GTG AGT GCA ACA GGG AGC AAA GGA AAC AGA ACC 847 
Gly Glu Trp Glu He Val Ser Ala Thr Gly Ser Lys Gly Asn Arg Thr 
220 225 230 

GAC AGC TGT TGC TGG TAT CCG TAT GTC ACT TAC TCA TTT GTA ATC AAG 895 
Asp Ser Cys Cys Trp Tyr Pro Tyr Val Thr Tyr Ser Phe Val He Lys 
235 240 245 

CGC CTG CCT CTC TTT TAT ACC TTG TTC CTT ATA ATA CCC TGT ATT GGG 943 
Arg Leu Pro Leu Phe Tyr Thr Leu Phe Leu He He Pro Cys He Gly 
250 255 ' 260 

CTC TCA TTT TTA ACT GTA CTT GTC TTC TAT CTT CCT TCA AAT GAA GGT 991 
Leu Ser Phe Leu Thr Val Leu Val Phe Tyr Leu Pro Ser Asn Glu Gly 
265 270 275 

GAA AAG ATT TGT CTC TGC ACT TCA GTA CTT GTG TCT TTG ACT GTC TTC 1039 
Glu Lys He Cys Leu Cys Thr Ser Val Leu Val Ser Leu Thr Val Phe 
280 285 290 295 

CTT CTG GTT ATT GAA GAG ATC ATA CCA TCA TCT TCA AAA GTC ATA CCT 1087 
Leu Leu Val He Glu Glu He He Pro Ser Ser Ser Lys Val He Pro 
300 305 310 

CTA ATT GGA GAG TAT CTG GTA TTT ACC ATG ATT TTT GTG ACA CTG TCA 1135 
Leu He Gly Glu Tyr Leu Val Phe Thr Met He Phe Val Thr Leu Ser 
315 320 325 

ATT ATG GTA ACC GTC TTC GCT ATC AAC ATT CAT CAT CGT TCT TCC TCA 1183 
He Met Val Thr Val Phe Ala He Asn He His His Arg Ser Ser Ser 
330 335 340 

ACA CAT AAT GCC ATG GCG CCT TTG GTC CGC AAG ATA TTT CTT CAC ACG 1231 
Thr His Asn Ala Met Ala Pro Leu Val Arg Lys He Phe Leu His Thr 
345 350 355 

CTT CCC AAA CTG CTT TGC ATG AGA AGT CAT GTA GAC AGG TAC TTC ACT 1279 
Leu Pro Lys Leu Leu Cys Met Arg Ser His Val Asp Arg Tyr Phe Thr 
360 365 370 375 

CAG AAA GAG GAA ACT GAG AGT GGT AGT GGA CC; lAAA TCT TCT AGA AAC 1327 
Gin Lys Glu Glu Thr Glu Ser Gly Ser Gly Pro » Lys Ser Ser Arg Asn 
380 385 390 



wo 96/41876 



PCTAJS96/09775 



-65- 



ACA TTG GAA GCT GCG CTC AAT TCT ATT CGC TAG ATT ACA AGA CAC ATC 1375 
Thr Leu Glu Ala Ala Leu Asn Ser lie Arg Tyr He Thr Arg His He 
395 400 405 

ATG AAG GAA AAT GAT GTC CGT QAG GTT GTT GAA GAT TGG AAA TTC ATA 1423 
Met Lys Glu Asn Asp Val Arg Glu Val Val Glu Asp Trp Lys Phe He 
410 415 420 

6CC CAG GTT CTT GAT CGG ATG TTT CT6 TGG ACT TTT CTT TTC GTT TCA 1471 
Ala Gin Val Leu Asp Arg Met Phe Leu Trp Thr Phe Leu Phe Val Ser 
425 430 435 

ATT GTT GGA TCT CTT GGG CTT TTT GTT CCT GTT ATT TAT AAA TGG GCA 1519 
He Val Gly Ser Leu Gly Leu Phe Val Pro Val He Tyr Lys Trp Ala 
440 445 450 455 

AAT ATA TTA ATA CCA GTT CAT ATT GGA AAT GCA AAT AAG TGA AGCCTCCCAA 1571 
Asn He Leu He Pro Val His He Gly Asn Ala Asn Lys * 
460 465 

GGGACTGAAG TATACATTTA GTTAACACAC ATATATCTGA TGGCACCTAT AAAATTATGA 1631 

AAATGTAAGT TATGTGTTAA ATTTAGTGCA AGCTTTAACA GACTAAGTTG CTAACCTCAA 1691 

TTTATG TTAA CAGATGATCC ATTTGAACAG TTGGCTGTAT GACTGAAGTA ATAACTGATG 1751 

AGAT ACAT TT QATCTTGTAA AAATAGCAAA ATATTATCTG AACTGGACTA GTGAAAAATC 1811 

TAGTATTTGT ATCCTGG 1828 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 



Met 


Ala 


Ala 


Arg 


Gly 


Ser 


Gly 


Pro 


Arg 


Ala 


Leu 


Arg Leu Leu Leu Leu 


1 








5 










10 




15 


Val 


Gin 


Leu 


Val 


Ala 


Gly 


Arg 


Cys 


Gly 


Leu 


Ala 


Gly Ala Ala Gly Gly 








20 










25 






30 


Ala 


Gin 


Arg 


Gly 


Leu 


Ser 


Glu 


Pro 


Ser 


Ser 


He 


Ala Lys His Glu Asp 






35 










40 








45 


Ser 


Leu 


Leu 


Lys 


Asp 


Leu 


Phe 


Gin 


Asp 


Tyr 


Glu 


Arg Trp Val Arg Pro 




SO 










55 










60 


Val 


Glu 


His 


Leu 


Asn 


Asp 


Lys 


He 


Lys 


He 


Lys 


Phe Gly Leu Ala He 


65 










70 










75 


80 


Ser 


Gin 


Leu 


Val 


Asp 


Val 


Asp 


Glu 


Lys 


Asn 


Gin 


Leu Met Thr Thr Asn 










85 










90 




95 


Val 


Trp 


Leu 


Lys 


Gin 


Glu 


Trp 


He 


Asp 


Val 


Lys 


Leu Arg Trp Asn Pro 








100 










105 






110 


Asp 


Asp 


Tyr 


Gly 


Gly 


He 


Lys 


Val 


He 


Arg 


Val 


Pro Ser Asp Ser Val 






115 










12 0 








125 


Trp 


Thr 


Pro 


Asp 


He 


Val 


Leu 


Phe 


Asp 


Asn 


Ala 


Asp Gly Arg Phe Glu 



130 135 140 
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Gly Thr Ser Thr Lys Thr Val lie Arg Tyr Asn Gly Thr Val Thr Trp 
145 150 155 160 

Thr Pro Pro Ala Asn Tyr Lys Ser Ser Cys Thr lie Asp Val Thr Phe 

165 170 175 

Phe Pro Phe Asp Leu Gin Asn Cys Ser Met Lys Phe Gly Ser Trp Thr 

180 185 190 

Tyr Asp Gly Ser Gin Val Asp lie lie lieu Glu Asp Gin Asp Val Asp. 

195 200 205 

Lys Arg Asp Phe Phe Asp Asn Gly Glu Trp Glu lie Val Ser Ala Thr 

210 215 220 

Gly Ser Lys Gly Asn Arg Thr Asp Ser Cys Cys Trp Tyr Pro Tyr Val 
225 230 235 240 

Thr Tyr Ser Phe Val He Lys Arg Leu Pro Leu Phe Tyr Thr Leu Phe 

245 250 255 

Leu He He Pro Cys He Gly Leu Ser Phe Leu Thr Val Leu Val Phe 

260 265 270 

Tyr Leu Pro Ser Asn Glu Gly Glu Lys He Cys Leu Cys Thr Ser Val 

275 280 285 

Leu Val Ser Leu Thr Val Phe Leu Leu Val He Glu Glu He He Pro 

290 295 300 

Ser Ser Ser Lys Val He Pro Leu He Gly Glu Tyr Leu Val Phe Thr 
305 310 315 320 

Met He Phe Val Thr Leu Ser He Met Val Thr Val Phe Ala He Asn 

325 330 335 

He His His Arg Ser Ser Ser Thr His Asn Ala Met Ala Pro Leu Val 

340 345 350 

Arg Lys He Phe Leu His Thr Leu Pro Lys Leu Leu Cys Met Arg Ser 

355 360 365 

His Val Asp Arg Tyr Phe Thr Gin Lys Glu Glu Thr Glu Ser Gly Ser 

370 375 380 

Gly Pro Lys Ser Ser Arg Asn Thr Leu Glu Ala Ala Leu Asn Ser He 
385 390 395 400 

Arg Tyr He Thr Arg His He Met Lys Glu Asn Asp Val Arg Glu Val 

405 410 415 

Val Glu Asp Trp Lys Phe He Ala Gin Val Leu Asp Arg Met Phe Leu 

420 425 430 

Trp Thr Phe Leu Phe Val Ser He Val Gly Ser Leu Gly Leu Phe Val 

435 440 445 

Pro Val He Tyr Lys Trp Ala Asn He Leu He Pro Val His He Gly 

450 455 460 

Asn Ala Asn Lys 
465 

(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1743 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence j 

(B) LOCATION: 143... 1627 • 

(D) OTHER INFORMATION: alpha6 subunit human neuronal 
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nicotinic acetylcholine receptor 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1...142 
(D) OTHER INFORMATION: 



(A) NAME/KEY: 3'XJTR 

(B) LOCATION: 1628... 1743 
(D) OTHER INFORMATION: 



(xi) SEQXJENCE DESCRIPTION: SEQ ID NO: 9: 

CGGGTTTTGA TTTCTGAGAA GACACACACG GATTGCAGTG GGCTTCTGAT GATGTCAAGG 60 
TTGGATGCAT GTGGCTGACT GATAGCTCTT TGTTTTCCAC AATCCTTTGC CTAGGAAAAA 120 
GGAATCCAAG TGTGTTTTAA CC ATG CTG ACC AGC AAG GGG CAG GGA TTC CTT 172 

Met Leu Thr Ser Lys Gly Gin Gly Phe Leu 
15 10 

CAT GGG GGC TTG TGT CTC TGG CTG TGT GTQ TTC ACA CCT TTC TTT AAA 220 
His Gly Gly Leu Cys Leu Trp Leu Cys Val Phe Thr Pro Phe Phe Lys 
15 20 25 

GGC TGT GTG GGC TGT GCA ACT GAG GAG AGG CTC TTC CAC AAA CTG TTT 268 
Gly Cys Val Gly Cys Ala Thr Glu Glu Arg Leu Phe His Lys Leu Phe 
30 35 40 

TCT CAT TAC AAC CAG TTC ATC AGG CCT GTG GAA AAC GTT TCC GAC CCT 316 
Ser His Tyr Asn Gin Phe lie Arg Pro Val Glu Asn Val Ser Asp Pro 
45 50 55 

GTC ACG GTA CAC TTT GAA GTG GCC ATC ACC CAG CTG GCC AAC GTG GAT 364 
Val Thr Val His Phe Glu Val Ala He Thr Gin Leu Ala Asn Val Asp 
60 65 70 

GAA GTA AAC CAG ATC ATG GAA ACC AAT TTG TGG CTG CGT CAC ATC TGG 412 
Glu Val Asn Gin He Met Glu Thr Asn Leu Trp Leu Arg His He Trp 
75 80 85 90 

AAT GAT TAT AAA TTG CGC TGG GAT CCA ATG GAA TAT GAT GGC ATT GAG 460 
Asn Asp Tyr Lys Leu Arg Trp Asp Pro Met Glu Tyr Asp Gly He Glu 
95 100 105 

ACT CTT CGC GTT CCT GCA GAT AAG ATT TGG AAG CCC GAC ATT GTT CTC 508 
Thr Leu Arg Val Pro Ala Asp Lys He Trp Lys Pro Asp He Val Leu 
110 115 120 

TAT AAC AAT GCT GTT GGT GAC TTC CAA GTA GAA GGC AAA ACA AAA GCT 556 
Tyr Asn Asn Ala Val Gly Asp Phe Gin Val Glu Gly Lys Thr Lys Ala 
125 130 135 

CTT CTT AAA TAC AAT GGC ATG ATA ACC TGG ACT CCA CCA GCT ATT TTT 604 
Leu Leu Lys Tyr Asn Gly Met He Thr Trp Thr Pro Pro Ala He Phe 
140 145 150 

AAG AGT TCC TGC CCT ATG GAT ATC ACC TTT TTC CCT TTT GAT CAT CAA 652 
Lys Ser Ser Cys Pro Met Asp He Thr Phe Phe Pro Phe Asp His Gin 
155 160 165 170 

AAC TGT TCC CTA AAA TTT GGT TCC TGG ACG TAT GAC AAA GCT GAA ATT 700 
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Asn Cys Ser Leu Lys Phe Gly Ser Trp Thr Tyr Asp Lys Ala Qlu lie 
175 180 185 

GAT CTT CTA ATC ATT GGA TCA AAA GTG GAT ATG AAT GAT TTT TGG GAA 748 
Asp Leu Leu He He Gly Ser Lys Val Asp Met Asn Asp Phe Trp Glu 
190 195 200 

AAC AGT GAA TGG GAA ATC ATT GAT GCC TCT GGC TAC AAA CAT GAC ATC 796 
Asn Ser Glu Trp Glu He He Asp Ala Ser Gly Tyr Lys His Asp He 
205 210 215 

AAA TAC AAC TGT TOT GAA GAG ATA TAC ACA GAT ATA ACC TAT TCT TTC 844 
Lys Tyr Asn Cys Cys Glu Glu He Tyr Thr Asp He Thr Tyr Ser Phe 
220 225 230 

TAC ATT AGA AGA TTG CCG ATG TTT TAC ACG ATT AAT CTG ATC ATC CCT 892 
Tyr He Arg Arg Leu Pro Met Phe Tyr Thr He Asn Leu He He Pro 
235 240 245 250 

TGT CTC TTT ATT TCA TTT CTA ACC GTG TTG GTC TTT TAC CTT CCT TCG 940 
Cys Leu Phe He Ser Phe Leu Thr Val Leu Val Phe Tyr Leu Pro Ser 
255 260 265 

GAC TGT GGT GAA AAA GTG ACG CTT TGT ATT TCA GTC CTG CTT TCT CTG 988 
Asp Cys Gly Glu Lys Val Thr Leu Cys He Ser Val Leu Leu Ser Leu 
270 275 280 

ACT GTG TTT TTG CTG GTC ATC ACA GAA ACC ATC CCA TCC ACA TCT CTG 1036 
Thr Val Phe Leu Leu Val He Thr Glu Thr He Pro Ser Thr Ser Leu 
285 290 295 

GTG GTC CCA CTG GTG GGT GAG TAC CTG CTG TTC ACC ATG ATC TTT GTC 1084 
Val Val Pro Leu Val Gly Glu Tyr Leu Leu Phe Thr Met He Phe Val 
300 305 310 

ACA CTG TCC ATC GTG GTG ACT GTG TTT GTG TTG AAC ATA CAC TAC CGC 1132 
Thr Leu Ser He Val Val Thr Val Phe Val Leu Asn He His Tyr Arg 
315 320 325 330 

ACC CCA ACC ACG CAC ACA ATG CCC AGG TGG GTG AAG ACA GTT TTC CTG 1180 
Thr Pro Thr Thr His Thr Met Pro Arg Trp Val Lys Thr Val Phe Leu 
335 340 345 

AAG CTG CTG CCC CAG GTC CTG CTG ATG AGG TGG CCT CTG GAC AAG ACA 1228 
Lys Leu Leu Pro Gin Val Leu Leu Met Arg Trp Pro Leu Asp Lys Thr 
350 355 360 

AGG GGC ACA GGC TCT GAT GCA GTG CCC AGA GGC CTT GCC AGG AGG CCT 1276 
Arg Gly Thr Gly Ser Asp Ala Val Pro Arg Gly Leu Ala Arg Arg Pro 
365 370 375 

GCC AAA GGC AAG CTT GCA AGC CAT GGG GAA CCC AGA CAT CTT AAA GAA 1324 
Ala Lys Gly Lys Leu Ala Ser His Gly Glu Pro Arg His Leu Lys Glu 
380 385 390 

TGC TTC CAT TGT CAC AAA TCA AAT GAG CTT GCC ACA AGC AAG AGA AGA 1372 
Cys Phe His Cys His Lys Ser Asn Glu Leu Ala Thr Ser Lys Arg TUrg 
395 400 405 410 

TTA AGT CAT CAG CCA TTA CAG TGG GTG GTG GAA AAT TCG GAG CAC TCG 1420 
Leu Ser His Gin Pro Leu Gin Trp Val Val Glu Asn Ser Glu His Ser 
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415 420 425 

CCT GAA GTT GAA GAT GTG ATT AAC AGT GTT CAG TTC ATA GCA GAA AAC 1468 
Pro Glu Val Glu Asp Val He Asn Ser Val Gin Phe He Ala Glu Asn 
430 435 440 

ATG AAG AGC CAC AAT GAA ACC AAG GAG GTA GAA GAT GAG TGG AAA TAG 1516 
Met Lys Ser His Asn Glu Thr Lys Glu Val Glu Asp Asp Trp Lys Tyr 
445 450 455 

GTG GCC ATG GTG GTG GAG AGA GTA TTT GTT TGG GTA TTT ATA ATT GTC 1564 
Val Ala Met Val Val Asp Arg Val Phe Leu Trp Val Phe He lie Val 
460 465 470 

TGT GTA TTT GGA ACT GCA GGG CTA TTT CTA CAG CCA CTA CTT GGG AAC 1612 
Cys Val Phe Gly Thr Ala Gly Leu Phe Leu Gin Pro Leu Leu Qly Asn 
475 480 485 490 

ACA GGA AAA TCT TAA AATGTATTTT CTTTTATGTT CAGAAATTTA CAGACACCAT AT 1669 
Thr Gly Lys Ser * 
495 

TTGTTCTGCA TTCCCTGCCA CAAGGAAAGG AAAGCAAAGG CTTCCCACCC AAGTCCCCCA 1729 
TCTGCTAAT^ CCCG 1743 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 495 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 



Met 


Leu 


Thr 


Ser 


Lys 


Gly 


Gin 


Gly 


Phe 


Leu 


His 


Gly Gly Leu Cys Leu 


1 








5 










10 




15 


Trp 


Leu 


Cys 


Val 


Phe 


Thr 


Pro 


Phe 


Phe 


Lys 


Gly 


Cys Val Gly Cys Ala 








20 










25 






30 


Thr 


Glu 


Glu 


Arg 


Leu 


Phe 


His 


Lys 


Leu 


Phe 


Ser 


His Tyr Asn Gin Phe 






35 










40 








45 


He 


Arg 


Pro 


Val 


Glu 


Asn 


Val 


Ser 


Asp 


Pro 


Val 


Thr Val His Phe Glu 




50 










55 








60 


Val 


Ala 


He 


Thr 


Gin 


Leu 


Ala 


Asn 


Val 


Asp 


Glu 


Val Asn Gin He Met 


65 










70 








75 


80 


Glu 


Thr 


Asn 


Leu 


Trp 


Leu 


Arg 


His 


He 


Trp 


Asn 


Asp Tyr Lys Leu Arg 










85 










90 




95 


Trp 


Asp 


Pro 


Met 


Glu 


Tyr 


Asp 


Gly 


He 


Glu 


Thr 


Leu Arg Val Pro Ala 








100 










105 






110 


Asp 


Lys 


He 


Trp 


Lys 


Pro 


Asp 


He 


Val 


Leu 


Tyr 


Asn Asn Ala Val Gly 






lis 










120 








125 


Asp 


Phe 


Gin 


Val 


Glu 


Gly 


Lys 


Thr 


Lys 


Ala 


Leu 


Leu Lys Tyr Asn Gly 




130 










135 










140 


Met 


He 


Thr 


Trp 


Thr 


Pro 


Pro 


Ala 


He 


Phe 


Lys 


Ser Ser Cys Pro Met 


145 










150 










155 


160 
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Asp 


lie 


Thr Phe 


Phe Pro Phe Asp His Gin 


Asn 


Cvs Ser Leu Lvs 


Phe 








165 


170 




X /D 




Glv 


Ser 


Trp Thr 


Tyr Asp Lys Ala Glu lie 




xjcu ±jcu xxc xxcs 


Gly 






180 




185 




X7U 


Sgst 


Lys 


Val Asp 


Met Asn Asp Phe Trp Glu 


Hen 


OCX \3XU AX^ uxU 


Tl a 






1Q5 




200 




one 




lie 


Asp 


Ala Ser 


Gly Tyr Lys His Asp He 


Lys 


xyx ash ^ys (jys 


\9XU 




210 






215 








Qlu 


lie 


Tyr Thr 


Asp He Thr Tyr Ser Phe 


lyr 


TT a HT*rt T.aii 
xxc nx^ aX%|^ ucU 


Pro 










230 






o An 


Met 


Phe 


Tyr Thr 


Xle 


Asn Leu He He Pro 


Cys 


Tt0ii Tl a Car* 

XJCU XrllC xxc OCX 


Dlia 

IrXlC 








245 


250 






lieu 


Thr 


Val Leu 


Val 


Phe Tyr Leu Pro Ser 


Asp 


^jro wXjr wXU ^ijfo 


Val 
vdx 










265 




^ /u 






Leu 


Cys lie 


Ser 


Val Leu Leu Ser Leu 


Thr 


Val OVia T>aii T.aii 

V a X JrXXC AiC U JJICU 


Val 
vclx 










280 








Tie 




Gin Thr 


lie 


Pro Ser Thr Ser Leu 


VAX 


Val Dv/> T.aii Val 
VaX irxO iJcU Val 


nl 
wiy 




o on 






295 




rt A 




Tyr 


Leu Leu 


Phe 


Thr Met He Phe Val 


inr 


Leu ber ne vax 


xr-»l 

vai 










310 


31b 




320 


IXU: 


VAX 


xrxie vau. 


Leu 


Asn He His Tyr Arg 


Thr 


Pro Thr Thr His 


Txir 








325 


330 




335 




Mai- 


Pro 


Arg Trp 


Val 


Lys Thr Val Phe Leu 


Lys 


Leu Leu Pro Gin 


vaJL 






340 




345 




350 




Leu 


Leu 


neu Arg 


Trp 


Pro Leu Asp Lys Thr 


Arg 


uxy xnr Gxy ser 


ASp 






^ e e 
355 




360 




365 






V ctx 




Gly Leu Ala Arg Arg Pro 




Xtjro wXy Xijro XiCU 


2Vl a 




370 






375 




380 




Ser 


His 


Gly Glu 


Pro Arg His Leu Lys Glu 


Cys 


Phe His Cys His 


Lys 


385 








390 


395 




400 


Ser 


Asn 


Glu Leu 


Ala 


Thr Ser Lys Arg Arg 


Leu 


Ser His Gin Pro 


Leu 








405 


410 




415 




Gin 


Trp 


Val Val 


Glu 


Asn Ser Glu His Ser 


Pro 


Glu Val Glu Asp 


Val 






420 




425 




430 




lie 


Asn 


Ser Val 


Gin 


Phe He Ala Glu Asn 


Met 


Lys Ser His Asn 


Glu 






435 




440 




445 




Thr 


Lys 


Glu Val 


Glu Asp Asp Trp Lys Tyr 


Val 


Ala Met Val Val 


Asp 




450 






455 




460 




Arg 


val 


Phe Leu 


Trp Val Phe He He Val 


Cys 


Val Phe Gly Thr 


Ala 


465 








470 


475 




480 


Gly 


Leu 


Phe Leu 


Gin Pro Leu Leu Gly Asn 


Thr 


Gly Lys Ser 





485 490 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 



(A) NAME /KEY: Coding Sequence 

(B) LOCATION: 73.., 1581 

(D) OTHER INFORMATION: alpha? human neuronal nicotinic 
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acetyl choline receptor 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1...72 
(D) OTHER INFORMATION: 



(A) NAME/KEY: 3'UTR 

(B) LOCATION: 1582... 1876 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGCCGCAGGC GCAGGCCCGG GCGACAGCCG AGACGTGGAG CGCGCCGGCT CGCTGCAGCT 60 
CCGGGACTCA AC ATG CGC TGC TCG CCG GGA GGC GTC TGG CTG GCG CTG GCC 111 
Met Arg Cys Ser Pro Gly Gly Val Trp Leu Ala Leu Ala 
15 10 

GCG TCG CTC CTG CAC GTG TCC CTG CAA GGC GAG TTC CAG AGG AAG CTT 159 
Ala Ser Leu Leu His Val Ser Leu Gin Gly Glu Phe Gin Arg Lys Leu 
15 20 25 

TAC AAG GAG CTG GTC AAG AAC TAC AAT CCC TTG GAG AGG CCC GTG GCC 207 
Tyr Lys Glu Leu Val Lys Asn Tyr Asn Pro Leu Glu Arg Pro Val Ala 
30 35 40 45 

AAT GAC TCG CAA CCA CTC ACC GTC TAC TTC TCC CTG AGC CTC CTG CAG 255 
Asn Asp Ser Gin Pro Leu Thr Val Tyr Phe Ser Leu Ser Leu Leu Gin 
50 55 60 

ATC ATG GAC GTG GAT GAG AAG AAC CAA GTT TTA ACC ACC AAC ATT TGG 303 
He Met Asp Val Asp Glu Lys Asn Gin Val Leu Thr Thr Asn He Trp 
65 70 75 

CTG CAA ATG TCT TGG ACA GAT CAC TAT TTA CAG TGG AAT GTG TCA GAA 351 
Leu Gin Met Ser Trp Thr Asp His Tyr Leu Gin Trp Asn Val Ser Glu 
80 85 90 

TAT CCA GGG GTG AAG ACT GTT CGT TTC CCA GAT GGC CAG ATT TGG AAA 399 
Tyr Pro Gly Val Lys Thr Val Arg Phe Pro Asp Gly Gin He Trp Lys 
95 100 105 

CCA GAC ATT CTT CTC TAT AAC AGT GCT GAT GAG CGC TTT GAC GCC ACA 447 
Pro Asp He Leu Leu Tyr Asn Ser Ala Asp Glu Arg Phe Asp Ala Thr 
110 115 120 125 

TTC CAC ACT AAC GTG TTG GTG AAT TCT TCT GGG CAT TGC CAG TAC CTG 495 
Phe His Thr Asn Val Leu Val Asn Ser Ser Gly His Cys Gin Tyr Leu 
130 135 140 

CCT CCA GGC ATA TTC AAG AGT TCC TGC TAC ATC GAT GTA CGC TGG TTT 543 
Pro Pro Gly He Phe Lys Ser Ser Cys Tyr He Asp Val Arg Trp Phe 
145 150 155 

CCC TTT GAT GTG CAG CAC TGC AAA CTG AAG TTT GGG TCC TGG TCT TAC 591 
Pro Phe Asp Val Gin His Cys Lys Leu Lys Phe Gly Ser Trp Ser Tyr 
160 165 170 

GGA GGC TGG TCC TTG GAT CTG CAG ATG CAG GAG GCA GAT ATC AGT GGC 639 
Gly Gly Trp Ser Leu Asp Leu Gin Met Gin Glu Ala Asp He Ser Gly 
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175 180 185 

TAT ATC CCC AAT GGA GAA TGG GAC CTA GTG GGA ATC CCC GGC AAG AGG 687 
Tyr lie Pro Asn Gly Glu Trp Asp Leu Val Gly He Pro Gly Lys Arg 
190 195 200 205 

AGT GAA AGG TTC TAT GAG TGC TGC AAA GAG CCC TAC CCC GAT GTC ACC 735 
Ser Glu Arg Phe Tyr Glu Cys Cys Lys Glu Pro Tyr Pro Asp Val Thr 
210 215 220 

TTC ACA GTG ACC ATG CGC CGC AGG ACG CTC TAC TAT GGC CTC AAC CTG 783 
Phe Thr Val Thr Met Arg Arg Arg Thr Leu Tyr Tyr Gly Leu Asn Leu 
225 230 235 

CTG ATC CCC TGT GTG CTC ATC TCC GCC CTC GCC CTG CTG GTG TTC CTG 831 
Leu He Pro Cys Val Leu He Ser Ala Leu Ala Leu Leu Val Phe Leu 
240 245 250 

CTT CCT GCA GAT TCC GGG GAG AAG ATT TCC CTG GGG ATA ACA GTC TTA 879 
Leu Pro Ala Asp Ser Gly Glu Lys He Ser Leu Gly He Thr Val Leu 
255 260 265 

CTC TCT CTT ACC GTC TTC ATG CTG CTC CTG GCT GAG ATC ATG CCC GCA 927 
Leu Ser Leu Thr Val Phe Met Leu Leu Val Ala Glu He Met Pro Ala 
270 275 280 285 

ACA TCC GAT TCG GTA CCA TTG ATA GCC CAG TAC TTC GCC AGC ACC ATG 975 
Thr Ser Asp Ser Val Pro Leu He Ala Gin Tyr Phe Ala Ser Thr Met 
290 295 300 

ATC ATC GTG GGC CTC TCG GTG GTG GTG ACG GTG ATC GTG CTG CAG TAC 1023 
He He Val Gly Leu Ser Val Val Val Thr Val He Val Leu Gin Tyr 
305 3ia 315 

CAC CAC CAC GAC CCC GAC GGG GGC AAG ATG CCC AAG TGG ACC AGA GTC 1071 
His His His Asp Pro Asp Gly Gly Lys Met Pro Lys Trp Thr Arg Val 
320 325 330 

ATC CTT CTG AAC TGG TGC GCG TGG TTC CTG CGA ATG AAG AGG CCC GGG 1119 
He Leu Leu Asn Trp Cys Ala Trp Phe Leu Arg Met Lys Arg Pro Gly 
335 340 345 

GAG GAC AAG GTG CGC CCG GCC TGC CAG CAC AAG CAG CGG CGC TGC AGC 1167 
Glu Asp Lys Val Arg Pro Ala Cys Gin His Lys Gin Arg Arg Cys Ser 
350 355 360 365 

CTG GCC AGT GTG GAG ATG AGC GCC GTG GCG CCG CCG CCC GCC AGC AAC 1215 
Leu Ala Ser Val Glu Met Ser Ala Val Ala Pro Pro Pro Ala Ser Asn 
370 375 380 

GGG AAC CTG CTG TAC ATC GGC TTC CGC GGC CTG GAC GGC GTG CAC TGT 1263 
Glv Asn Leu Leu Tyr He Gly Phe Arg Gly Leu Asp Gly Val His Cys 
385 390 395 

GTC CCG ACC CCC GAC TCT GGG GTA GTG TGT GGC CGC ATG GCC TGC TCC 1311 
Val Pro Thr Pro Asp Ser Gly Val Val Cys Gly Arg Met Ala Cys Ser 
400 405 410 

CCC ACG CAC GAT GAG CAC CTC CTG CAC GGC GGG CAA CCC CCC GAG GGG 1359 
Pro Thr His Asp Glu His Leu Leu His Gly Gly Gin Pro Pro Glu Gly 
415 420 425 
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GAC CCG GAC TTG GCC AAG ATC CTG GAG GAG GTC CGC TAG ATT GCC AAT 1407 
Asp Pro Asp Leu Ala Lys lie Leu Glu Glu Val Arg Tyr lie Ala Asn 
430 435 440 445 

CGC TTC CGC TGC CAG GAC GAA AGC GAG GCG GTC TGC AGC GAG TGQ AAG 1455 
Arg Phe Arg Cys Gin Asp Glu Ser Glu Ala Val Cys Ser Glu Trp Lys 
450 455 460 

TTC GCC GCC TGT GTG GTG GAC CGC CTG TGC CTC ATG GCC TTC TCG GTC 1503 
Phe Ala Ala Cys Val Val Asp Arg Leu Cys Leu Met Ala Phe Ser Val 
465 470 475 

TTC ACC ATC ATC TGC ACC ATC GGC ATC CTG ATG TCG GCT CCC AAC TTC 1551 
Phe Thr lie He Cys Thr He Gly He Leu Met Ser Ala Pro Asn Phe 
480 485 490 

GTG GAG GCC GTG TCC AAA GAC TTT GCG TAA CCACGCCTGG TTCTGTACAT GTGG 1605 
Val Glu Ala Val Ser Lys Asp Phe Ala * 
495 500 

AAAACTCACA GAT66GCAAG GCCTTTGGCT TGGCGAGATT TGGGGGTGCT AATCCAGGAC 1665 

AGCATTACAC GCCACAACTC CAGTGTTCCC TTCTGGCTGT CAGTCGTGTT GCTTACG6TT 1725 

TCTTTGTTAC TTTAGGTAGT AGAATCTCAG CACTTTGTTT CATATTCTCA GATGGGCTGA 1785 

TAGATATCCT TGGCACATCC GTACCATCGG TCAGCAGGGC CACTGAQTAG TCATTTTGCC 1845 

CATTAGCCCA CTGCCTGGAA AGCCCTTCGG A 1876 

(2) INFORMATION FOR SSQ ID NO: 12: 

(i) SEQT3ENCE CHARACTERISTICS: 

(A) LENGTH: 446 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

<v) FRAGMENT TYPE: N- terminal 
(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



Met 


Arg 


Cys 


Ser 


Pro 


Gly 


Gly 


Val 


Trp 


Ala 


Ala 


Ala Ser His Val Ser 


1 








5 










10 




15 


Gin 


Gly 


Glu 


Phe 


Gin 


Arg 


Lys 


Tyr 


Lys 


Glu 


Val 


Lys Asn Tyr Asn Pro 








20 










25 






30 


Glu 


Arg 


Pro 


Val 


Ala 


Asn 


Asp 


Ser 


Gin 


Pro 


Thr 


Val Tyr Phe Ser Ser 






35 










40 








45 


Gin 


He 


Met 


Asp 


Val 


Asp 


Glu 


Lys 


Asn 


Gin 


Val 


Thr Thr Asn He Trp 




50 










55 










60 


Gin 


Met 


Ser 


Trp 


Thr 


Asp 


His 


Tyr 


Gin 


Trp 


Asn 


Val Ser Glu Tyr Pro 


65 










70 










75 


80 


Gly 


Val 


Lys 


Thr 


Val 


Arg 


Phe 


Pro 


Asp 


Gly 


Gin 


He Trp Lys Pro Asp 










85 










90 




95 


He 


Tyr 


Asn 


Ser 


Ala 


Asp 


Glu 


Arg 


Phe 


Asp 


Ala 


Thr Phe His Thr Asn 








100 










105 






110 


Val 


Val 


Asn 


Ser 


Ser 


Gly 


His 


Cys 


Gin 


Tyr 


Pro 


Pro Gly He Phe Lys 






115 










120 








125 


Ser 


Ser 


Cys 


Tyr 


He 


Asp 


Val 


Arg 


Trp 


Phe 


Pro 


Phe Asp Val Gin His 




130 










135 










140 


Cys 


Lys 


Lys 


Phe 


Gly 


Ser 


Trp 


Ser 


Tyr 


Gly 


Gly 


Trp Ser Asp Gin Met 
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145 


150 










155 




160 


Gin Glu Ala 


Asp lie Ser 


Gly 


Tyr 


He 


Pro 


Asn 


Gly Glu Trp Asp 


Val 




165 








170 




175 




Oly lie Pro 


Gly Lys Arg 


Ser 


Glu 


Arg 


Phe 


Tvr 


Glu Cys Cys Lys 


Glu 




180 






185 






190 




Pro Tyr Pro 


Asp Val Thr 


Phe 


Thr 


Val 


Thr 


Met 


Arg Arg Arg Thr 


Tvi" 


195 






200 








205 


Tyr Gly Asn 


lie Pro Cys 


Val 


He 


Ser 


Ala 


Ala 


Val Phe Pro Ala 

VCiX IrXiC? IrXU 


Asp 


210 




215 










220 


Ser Gly Glu 


Lys lie Ser 


Gly 


lie 


Thr 


Val 


Ser 


Thr Val Phe Met 


Val 


225 


230 










235 






Ala Glu lie 


Met Pro Ala 


Thr 


Ser 


Asp 


Ser 


Val 


Pro He Ala Gin 


Tyr 




245 








250 




9 9 


Phe Ala Ser 


Thr Met lie 


He 


Val 


Glv 


Ser 


Val 


Val Val Thr Val 


He 




260 






265 






270 




Val Gin Tyr 


His His His 


Asp 


Pro 


Asp 


Glv 


Glv 


Lys Met Pro Lys 




275 






280 








285 


Thr Arg Val 


lie Asn Trp 


Cys 


Ala 


Trp 


Phe 


Ara 


Met Lys Arg Pro 


Glv 


290 




295 










300 


Glu Asp Lys 


Val Arg Pro 


Ala 


Cvs 


Gin 


His 


Lys 


Gin Ato Apo Cvs 
Vwj^o 


Ser 


305 


310 










315 




Ala Ser Val 


Glu Met Ser 


Ala 


Val 


Ala 


Pro 


Pro 


Ovn Zkl A QoT" lian 

K^AVJ MJLd OB^ nou 






325 








330 






Asn Tyr lie 


Gly Phe Arg 


Gly 


Asp 


Gly 


Val 


His 


Cys Val Pro Thr 


Pro 




340 






345 






350 




Asp Ser Gly 


Val Val Cys 


Gly 


Arg 


Met 


Ala 


Cys 


Ser Pro Thr His 


Asp 


355 






360 








365 


Glu His His 


Gly Gly Gin 


Pro 


Pro 


Glu 


Gly 


Asp 


Pro Asp Ala Lys 


He 


370 




375 










380 




Glu Glu Val 


Arg Tyr lie 


Ala 


Asn 


Arg 


Phe 


Arg 


Cys Gin Asp Glu 


Ser 


385 


390 










395 




400 


Glu Ala Val 


Cys Ser Glu 


Trp 


Lys 


Phe 


Ala 


Ala 


Cys Val Val Asp 


Arg 




405 








410 




415 




Cys Met Ala 


Phe Ser Val 


Phe 


Thr 


He 


He 


Cys 


Thr He Gly He 


Met 




420 






425 






430 




Ser Ala Pro 


Asn Phe Val 


Glu 


Ala 


Val 


Ser 


Lys 


Asp Phe Ala 




435 






440 






445 





(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2448 base pairs 

(B) TVPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 265... 1773 

(O) OTHER INFORMATION: beta2 human neuronal nicotinic 

acetylcholine receptor 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1...264 
(D) OTHER INFORMATION: 
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(A) NAME/KEY: 3'UTR 

(B) LOCATION: 1774... 2448 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CTCCTCCCCC TCACCGTCCC AATTGTATTC CCTGG7VAGAG CAGCCGGAAA A6CCTCCGCC 60 
TGCTCATACC AGGATAGGCA AGAAGCTGGT TTCTCCTCGC AGCCGGCTCC CTGAGGCCCA 120 
GGAACCACCG CGGCGGCCGG CACCACCTGG ACCCAGCTCC AGGCGGGCGC GGCTTCAGCA 180 
CCACGGACAG CGCCCCACCC GCGGCCCTCC CCCCGGC6GC GCGCTCCAGC CG6TGTAGGC 240 
GAGGCAGCGA GCTATGCCCG CGGC ATG GCC CGG CGC TGC GGC CCC GTG GCG 291 

Met Ala Arg Arg Cys Gly Pro Val Ala 
1 5 

CTG CTC CTT GGC TTC GGC CTC CTC CGG CTG TGC TCA GGG GTG TGG GGT 339 
Leu Leu Leu Gly Phe Gly Leu Leu Arg Leu Cys Ser Gly Val Trp Gly 
10 15 20 25 

ACG GAT ACA GAG GAG CGG CTG GTG GAG CAT CTC CTG GAT CCT TCC CGC 387 
Thr Asp Thr Glu Glu Arg Leu Val Glu His Leu Leu Asp Pro Ser Arg 
30 35 40 

TAC AAC AAG CTT ATC CGC CCA GCC ACC AAT GGC TCT GAG CTG GTG ACA 435 
Tyr Asn Lys Leu lie Arg Pro Ala Thr Asn Gly Ser Glu Leu Val Thr 
45 50 55 

GTA CAG CTT ATG GTG TCA CTG GCC CAG CTC ATC AGT GTG CAT GAG CGG 483 
Val Gin Leu Met Val Ser Leu Ala Gin Leu lie Ser Val His Glu Arg 
60 65 70 

GAG CAG ATC ATG ACC ACC AAT GTC TGG CTG ACC CAG GAG TGG GAA GAT 531 
Glu Gin lie Met Thr Thr Asn Val Trp Leu Thr Gin Glu Trp Glu Asp 
75 80 85 

TAT CGC CTC ACC TGG AAG CCT GAA GAG TTT GAC AAC ATG AAG AAA GTT 579 
Tyr Arg Leu Thr Trp Lys Pro Glu Glu Phe Asp Asn Met Lys Lys Val 
90 95 100 105 

CGG CTC CCT TCC AAA CAC ATC TGG CTC CCA GAT GTG GTC CTG TAC AAC 627 
Arg Leu Pro Ser Lys His lie Trp Leu Pro Asp Val Val Leu Tyr Asn 
110 115 120 

AAT GCT GAC GGC ATG TAC GAG GTG TCC TTC TAT TCC AAT GCC GTG GTC 675 
Asn Ala Asp Gly Met Tyr Glu Val Ser Phe Tyr Ser Asn Ala Val Val 
125 130 135 



TCC TAT GAT GGC AGC ATC TTC TGG CTG CCG CCT GCC ATC TAC AAG AGC 723 
Ser Tyr Asp Gly Ser He Phe Trp Leu Pro Pro Ala He Tyr Lys Ser 
140 145 ' 150 

GCA TGC AAG ATT GAA GTA AAG CAC TTC CCA TTT GAC CAG CAG AAC TGC 771 
Ala Cys Lys He Glu Val Lys His Phe Pro Phe Asp Gin Gin Asn Cys 
155 160 165 

ACC ATG AAG TTC CGT TCG TGG ACC TAC GAC CGC ACA GAG ATC GAC TTG 819 
Thr Met Lys Phe Arg Ser Trp Thr Tyr Asp Arg Thr Glu He Asp Leu 
170 175 180 185 
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GTG CTG AAG AGT GAG GTG GCC AGC CTG GAC GAC TTC ACA CCT AGT GGT 867 
Val Leu Lys Ser Glu Val Ala Ser Leu Asp Asp Phe Thr Pro Ser Gly 
190 195 200 

GAG TGG GAC ATC GTG GCG CTG CCG GGC CGG CGC AAC GAG AAC CCC GAC 915 
Glu Trp Asp He Val Ala Leu Pro Gly Arg Arg Asn Glu Asn Pro Asp 
205 210 215 

GAC TCT ACG TAC GTG GAC ATC ACG TAT GAC TTC ATC ATT CGC CGC AAG 963 
Asp Ser Thr Tyr Val Asp He Thr Tyr Asp Phe He He Arg Arg Lys 
220 225 230 

CCG CrC TTC TAC ACC ATC AAC CTC ATC ATC CCC TGT GTG CTC ATC ACC 1011 
Pro Leu Phe Tyr Thr He Asn Leu He He Pro Cys Val Leu He Thr 
235 240 245 

TCG CTA GCC ATC CTT GTC TTC TAC CTG CCA TCC GAC TGT GGC GAG AAG 1059 
Ser Leu Ala He Leu Val Phe Tyr Leu Pro Ser Asp Cys Gly Glu Lys 
250 255 260 265 

ATG ACG TTG TGC ATC TCA GTG CTG CTG GCG CTC ACG GTC TTC CTG CTG 1107 
Met Thr Leu Cys He Ser Val Leu Leu Ala Leu Thr Val Phe Leu Leu 
270 275 280 

CTC ATC TCC AAG ATC QTG CCT CCC ACC TCC CTC GAC GTG CCG CTC GTC 1155 
Leu He Ser Lys He Val Pro Pro Thr Ser Leu Asp Val Pro Leu Val 
285 290 295 

GGC AAG TAC CTC ATG TTC ACC ATG GTG CTT GTC ACC TTC TCC ATC GTC 1203 
Glv Lys Tyr Leu Met Phe Thr Met Val Leu Val Thr Phe Ser He Val 
300 305 310 

ACC AGC GTG TGC GTG CTC AAC GTG CAC CAC CGC TCG CCC ACC ACG CAC 1251 
Thr Ser Val Cys Val Leu Asn Val His His Arg Ser Pro Thr Thr Hxs 
315 320 325 

ACC ATG GCG CCC TGG GTG AAG GTC GTC TTC CTG GAG AAG CTG CCC GCG 1299 
Thr Met Ala Pro Trp Val Lys Val Val Phe Leu Glu Lys Leu Pro Ala 
330 335 340 345 

CTG CTC TTC ATG CAG CAG CCA CGC CAT CAT TGC GCC CGT CAG CGC CTG 1347 
Leu Leu Phe Met Gin Gin Pro Arg His His Cys Ala Arg Gin Arg Leu 
350 355 360 

CGC CTG CGG CGA CGC CAG CGT GAG CGC GAG GGC GCT GGA GCC CTC TTC 1395 
Arg Leu Arg Arg Arg Gin Arg Glu Arg Glu Gly Ala Gly Ala Leu Phe 
365 370 375 

TTC CGC GAA GCC CCA GGG GCC GAC TCC TGC ACG TGC TTC GTC AAC CGC 1443 
Phe Arq Glu Ala Pro Gly Ala Asp Ser Cys Thr Cys Phe Val Asn Arg 
^ 380 385 390 

GCG TCG GTG CAG GGG TTG GCC GGG GCC TTC GGG GCT GAG CCT GCA CCA 1491 
Ala ser Val Gin Gly Leu Ala Gly Ala Phe Gly Ala Glu Pro Ala Pro 
395 400 405 

GTG GCG GGC CCC GGG CGC TCA GGG GAG CCG TGT GGC TGT GGC CTC CGG 
val Ala Gly Pro Gly Arg Ser Gly Glu Pro Cys Gly Cys Gly Leu Arg 
410 415 420 425 

GAG GCG GTG GAC GGC GTG CGC TTC ATC GCA GAC CAC ATG CGG AGC GAG 



1539 



1587 
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Glu Ala Val Asp Gly Val Arg Phe lie Ala Asp His Met Arg Ser 61u 
430 435 440 

GAC GAT GAC CAG AGC GTG AGT GAG GAC TOG AAG TAC GTC GCC ATG GTG 1635 
Asp Asp Asp Gin Ser Val Ser Glu Asp Trp Lys Tyr Val Ala Met Val 
445 450 455 

ATC GAC CGC CTC TTC CTC TGG ATC TTT GTC TTT GTC TGT GTC TTT GGC 1683 
lie Asp Arg Leu Phe Leu Trp lie Phe Val Phe Val Cys Val Phe Gly 
460 465 470 

ACC ATC GGC ATG TTC CTG CAG CCT CTC TTC CAG AAC TAC ACC ACC ACC 1731 
Thr He Gly Met Phe IjCU Gin Pro Leu Phe. Gin Asn Tyr Thr Thr Thr 
475 480 485 

ACC TTC CTC CAC TCA GAC CAC TCA GCC CCC AGC TCC AAG TGA GGCCCTTCCT 1783 
Thr Phe Leu His Ser Asp His Ser Ala Pro Ser Ser Lys * 
490 495 500 

CATCTCCATG CTCTTTCACC CTGCCACCCT CTGCTGCACA GTAGTGTTGG GTGGAGGATG 1843 

GACGAGTGAG CTACCAGGAA GAGGGGCGCT GCCCCCACAG ATCCATCCTT TTGCTTCATC 1903 

TGGAGTCCCT CCTCCCCCAC GCCTCCATCC ACACACAGCA GCTCCAACCT GGAGGCTGGA 1963 

CCAACTGCTT TGTTTTGGCT GCTCTCCATC TCTTGTACCA GCCCAGGCAA TAGTGTTGAG 2023 

GAGGGGAGCA AGGCTGCTAA GTGGAAGACA GAGATGGCAG AGCCATCCAC CCTGAGGAGT 2083 

GACGGGCAA6 GGGCCAGGAA GGGGACAGGA TTGTCTGCTG CCTCCAAGTC ATGGGAGAAG 2143 

AGGGGTATAG GACAAGGGGT GGAAGGGCAG GAGCTCACAC CGCACCGGGC TGGCCTGACA 2203 

CAATGGTAGC TCTGAAGGGA GGGGAAGAGA GAGGCCTGGG TGTGACCTGA CACCTGCCGC 2263 

TGCTTGAGTG GACAGCAGCT GGACTGGGTG GGCCCCACAG TGGTCAGCGA TTCCTGCCAA 2323 

GTAGGGTTTA GCCGGGCCCC ATGGTCACAG ACCCCTGGGG GAGGCTTCCA GCTCAGTCCC 2383 

ACAGCCCCTT GCTTCTAAGG GATCCAGAGA CCTGCTCCAG ATCCTCTTTC CCCACTGAAG 2443 

AATTC 2448 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



Met 


Ala 


Arg 


Arg 


Cys 


Gly 


Pro 


Val Ala 


Leu 


Leu 


Leu Gly Phe Gly Leu 


1 




5 








10 




15 




Leu 


Arg 


Leu 


Cys 


Ser 


Gly 


Val 


Trp Gly 


Thr 


Asp 


Thr Glu Glu Arg 


Leu 






20 








25 






30 




Val 


Glu 


His 


Leu 


Leu 


Asp 


Pro 


Ser Arg 


Tyr 


Asn 


Lys Leu He Arg 


Pro 






35 








40 






45 




Ala 


Thr 


Asn 


Gly 


Ser 


Glu 


Leu 


Val Thr 


Val 


Gin 


Leu Met Val Ser 


Leu 




50 








55 








60 




Ala 


Gin 


Leu 


He 


Ser 


Val 


His 


Glu Arg 


Glu 


Gin 


He Met Thr Thr 


Asn 


65 










70 








75 




80 


Val 


Trp 


Leu 


Thr 


Gin 


Glu 


Trp 


Glu Asp 


Tyr 


Arg 


Leu Thr Trp Lys 


Pro 








85 








90 




95 




Glu 


Glu 


Phe 


Asp 


Asn 


Met 


Lys 


Lys Val 


Arg 


Leu 


Pro Ser Lys His 


He 
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100 105 110 

Trp Leu Pro Asp Val Val Leu Tyr Asn Asn Ala Asp Gly Met Tyr Glu 

115 120 125 

Val Ser Phe Tyr Ser Asn Ala Val Val Ser Tyr Asp Gly Ser lie Phe 

130 135 140 

Trp Leu Pro Pro Ala lie Tyr Lys Ser Ala Cys Lys lie Glu Val Lys 
145 150 155 160 

His Phe Pro Phe Asp Gin Gin Asn Cys Thr Met Lys Phe Arg Ser Trp 

165 170 175 

Thr Tyr Asp Arg Thr Glu lie Asp Leu Val Leu Lys Ser Glu Val Ala 

180 185 190 

Ser Leu Asp Asp Phe Thr Pro Ser Gly Glu Trp Asp lie Val Ala Leu 

195 200 205 

Pro Gly Arg Arg Asn Glu Asn Pro Asp Asp Ser Thr Tyr Val Asp He 

210 215 220 

Thr Tyr Asp Phe He He Arg Arg Lys Pro Leu Phe Tyr Thr He Asn 
225 230 235 240 

Leu He He Pro Cys Val Leu He Thr Ser Leu Ala He Leu Val Phe 

245 250 255 

Tyr Leu Pro Ser Asp Cys Gly Glu Lys Met Thr Leu Cys He Ser Val 

260 265 270 

Leu Leu Ala Leu Thr Val Phe Leu Leu Leu He Ser Lys He Val Pro 

275 280 285 

Pro Thr Ser Leu Asp Val Pro Leu Val Gly Lys Tyr Leu Met Phe Thr 

290 295 300 

Met Val Leu Val Thr Phe Ser He Val Thr Ser Val Cys Val Leu Asn 
305 310 315 320 

Val His His Arg Ser Pro Thr Thr His Thr Met Ala Pro Trp Val Lys 

325 330 335 

Val Val Phe Leu Glu Lys Leu Pro Ala Leu Leu Phe Met Gin Gin Pro 

340 345 350 

Arg His His Cys Ala Arg Gin Arg Leu Arg Leu Arg Arg Arg Gin Arg 

355 360 365 

Glu Arg Glu Gly Ala Gly Ala Leu Phe Phe Arg Glu Ala Pro Gly Ala 

370 375 380 

Asp Ser Cys Thr Cys Phe Val Asn Arg Ala Ser Val Gin Gly Leu Ala 
365 390 395 400 

Gly Ala Phe Gly Ala Glu Pro Ala Pro Val Ala Gly Pro Gly Arg Ser 

405 410 415 

Gly Glu Pro Cys Gly Cys Gly Leu Arg Glu Ala Val Asp Gly Val Arg 

420 425 430 

Phe He Ala Asp His Met Arg Ser Glu Asp Asp Asp Gin Ser Val Ser 

435 440 445 

Glu Asp Trp Lys Tyr Val Ala Met Val He Asp Arg Leu Phe Leu Trp 

450 455 460 

He Phe Val Phe Val Cys Val Phe Gly Thr He Gly Met Phe Leu Gin 
465 470 475 480 

Pro Leu Phe Gin Asn Tyr Thr Thr Thr Thr Phe Leu His Ser Asp His 

485 490 495 

Ser Ala Pro Ser Ser Lys 
500 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1925 base pairs 

(B) TYPE: nucleic acid- 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(iii) HYPOTHETIC/^: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 



(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 98... 14 74 

(D) OTHER INFORMATION: beta3 human neuronal nicotinic 

acetylcholine receptor 

(A) NAME/KEY: S'TTTR 

(B) LOCATION: 1...97 
(D) OTHER INFORMATION: 



(A) NAME/KEY: 3'UTR 

(B) LOCATION: 1475... 1927 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TCGGAACCCC TGTATTTTCT TTTCAAAACC CCCTTTTCCA GTGGAAATGC TCTGTTGTTA 60 
AAAAGGAAGA AACTGTCTTT CTGAAACTGA CATCACG ATG CTC CCA GAT TTT ATG 115 

Met Leu Pro Asp Phe Met 
1 5 

CTG GTT CTC ATC GTC CTT GGC ATC CCT TCC TCA GCC ACC ACA G6T TTC 163 
Leu Val Leu lie Val Leu Gly lie Pro Ser Ser Ala Thr Thr Gly Phe 
10 15 20 

AAC TCA ATC GCC GAA AAT GAA GAT GCC CTC CTC AGA CAT TTG TTC CAA 211 
Asn Ser lie Ala Glu Asn Glu Asp Ala Leu Leu Arg His Leu Phe Gin 
25 30 35 

GGT TAT CAG AAA TGG GTC CGC CCT GTA TTA CAT TCT AAT GAC ACC ATA 259 
Gly Tyr Gin Lys Trp Val Arg Pro Val Leu His Ser Asn Asp Thr lie 
40 45 50 

AAA GTA TAT TTT GGA TTG AAA ATA TCC CAG CTT GTA GAT GTG GAT GAA . 307 
Lys Val Tyr Phe Gly Leu Lys lie Ser Gin Leu Val Asp Val Asp Glu 
55 60 65 70 

AAG AAT CAG CTG ATG ACA ACC AAT GTG TGG CTC AAA CAG GAA TGG ACA 355 
Lys Asn Gin Leu Met Thr Thr Asn Val Trp Leu Lys Gin Glu Trp Thr 
75 80 85 

GAC CAC AAG TTA CGC TGG AAT CCT GAT GAT TAT GGT GGG ATC CAT TCC 403 
Asp His Lys Leu Arg Trp Asn Pro Asp Asp Tyr Gly Gly lie His Ser 
90 95 100 

ATT AAA GTT CCA TCA GAA TCT CTG TGG CTT CCT GAC ATA GTT CTC TTT 451 
lie Lys Val Pro Ser Glu Ser Leu Trp Leu Pro Asp lie Val Leu Phe 
105 110 115 

GAA AAT GCT GAC GGC CGC TTC GAA GGC TCC CTG ATG ACC AAG GTC ATC 499 
Glu Asn Ala Asp Gly Arg Phe Glu Gly Ser Leu Met Thr Lys Val lie 
120 125 130 



GTG AAA TCA AAC GGA ACT GTT GTC TGG ACC CCT CCC GCC AGC TAG ATA 



547 
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Val Lys Ser Asn Gly Thr Val Val Trp Thr Pro Pro Ala Ser Tyr Lys 
135 140 145 150 

AGC TCC TGC ACC ATG GAC GTC ACQ TTT TTC CCG TTC GAC CGA CAG AAC 595 
Ser Ser Cys Thr Met Asp Val Thr Phe Phe Pro Phe Asp Arg Gin Asn 
155 X60 165 

TGC TCC ATG AAG TTT GGA TCC TGG ACT TAT GAT GGC ACC ATG GTT GAC 643 
Cys Ser Met Lys Phe Gly Ser Trp Thr Tyr Asp Gly Thr Met Val Asp 
170 175 180 

CTC ATT TTG ATC AAT GAA AAT GTC GAC AGA AAA GAC TTC TTC GAT AAC 691 
Leu lie Leu He Asn Glu Asn Val Asp Arg Lys Asp Phe Phe Asp Asn 
185 190 195 

GGA GAA TGG GAA ATA CTG AAT GCA AAG GGG ATG AAG GGG AAC AGA AGG 739 
Gly Glu Trp Glu He Leu Asn Ala Lys Gly Met Lys Gly Asn Arg Aro 
200 205 210 

GAC GGC GTG TAC TCC TAT CCC TTT ATC ACG TAT TCC TTC GTC CTG AGA 787 
Asp Gly Val Tyr Ser Tyr Pro Phe He Thr Tyr Ser Phe Val Leu Arg 
215 220 225 230 

CGC CTG CCT TTA TTC TAT ACC CTC TTT CTC ATC ATC CCC TGC CTG GGG 835 
Arg Leu Pro Leu Phe Tyr Thr Leu Phe Leu He He Pro Cys Leu Gly 
235 240 245 

CTG TCT TTC CTA ACA GTT CTT GTG TTC TAT TTA CCT TCG GAT GAA GGA 883 
Leu Ser Phe Leu Thr Val Leu Val Phe Tyr Leu Pro Ser Asp Glu Gly 
250 255 260 

GAA AAA CTT TCA TTA TCC ACA TCG GTC TTG GTT TCT CTG ACA GTT TTC 931 
Glu Lys Leu Ser Leu Ser Thr Ser Val Leu Val Ser Leu Thr Val Phe 
265 270 275 

CTT TTA GTG ATT GAA GAA ATC ATC CCA TCG TCT TCC AAA GTC ATT CCT 979 
Leu Leu Val He Glu Glu He He Pro Ser Ser Ser Lys Val He Pro 
280 285 290 

CTC ATT GGA GAG TAC CTG CTG TTC ATC ATG ATT TTT GTG ACC CTG TCC 1027 
Leu He Gly Glu Tyr Leu Leu Phe He Met He Phe Val Thr Leu Ser 
295 300 305 310 

ATC ATT GTT ACC GTG TTT GTC ATT AAC GTT CAC CAC AGA TCT TCT TCC 1075 
He He Val Thr Val Phe Val He Asn Val His His Arg Ser Ser Ser 
315 320 325 

ACG TAC CAC CCC ATG GCC CCC TGG GTT AAG AGG CTC TTT CTG CAG AAA 1123 
Thr Tyr His Pro Met Ala Pro Trp Val Lys Arg Leu Phe Leu Gin Lys 
330 335 340 

CTT CCA AAA TTA CTT TGC ATG AAA GAT CAT GTG GAT CGC TAC TCA TCC 1171 
Leu Pro Lys Leu Leu Cys Met Lys Asp His Val Asp Arg Tyr Ser Ser 
345 350 355 

CCA GAG AAA GAG GAG AGT CAA CCA GTA GTG AAA GGC AAA GTC CTC GAA 1219 
Pro Glu Lys Glu Glu Ser Gin Pro Val Val Lys Gly Lys Val Leu Glu 
360 365 370 

AAA AAG AAA CAG AAA CAG CTT AGT GAT GGA GAA AAA GTT CTA GTT GCT 1267 
Lys Lys Lys Gin Lys Gin Leu Ser Asp Gly Glu Lys Val Leu Val Ala 
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375 380 385 390 

TTT TTG GAA AAA GCT OCT GAT TCC ATT AGA TAG ATT TCC AQA CAT GTG 1315 
Phe Leu Glu Lys Ala Ala Asp Ser He Arg Tyr He Ser Arg His Val 
395 400 405 

AAG AAA GAA CAT TTT ATC AGC CAG GTA GTA CAA GAC TGG AAA TTT 6TA 1363 
Lys Lys Glu His Phe He Ser Gin Val Val Gin Asp Trp Lys Phe Val 
410 415 420 

GCT CAA GTT CTT GAC CGA ATC TTC CTG TGG CTC TTT CTG ATA GTG TCA 1411 
Ala Gin Val Leu Asp Arg He Phe Leu Trp Leu Phe Leu He Val Ser 
425 430 435 

GTA ACA GGC TCG GTT CTG ATT TTT ACC CCT GCT TTG AAG ATG TGG CTA 1459 
Val Thr Gly Ser Val Leu He Phe Thr Pro Ala Leu Lys Met Trp Leu 
440 445 450 

CAT AGT TAC CAT TAG GAATTTAAAA GACATAAGAC TAAATTACAC CTTAGACCTG AC 1516 

His Ser Tyr His * 

455 

ATCTGGCTAT CACACAGACA GAATCCAAAT GCAT6TGCTT GTTCTACGAA CCCCGAATGC 1576 

GTTGTCTTTG TGQAAATGGA ACATCTCCTC ATGGGAGAAA CTCTGGTAAA TGTGCTCATT 1636 

TGTGGTTGCC ATGAGAGTGA GCTGCTTTTA AAGAAAGTGG AGCCTCCTCA GACCCCTGCC 1696 

TTGGCTTTCC CAGACATTCA GGGAGGGATC ATAGGTCCAG GCTTGAGCTC ACATGTGGCC 1756 

AGAGTGCACA AAAAGCTGTT GCTACTTGGT GGAGGAACAC CTCCTAGAAG CAGCAGGCCT 1816 

CGGTGGTGGG GGAGGGGGGA TTCACCTGGA ATTAAGGAAG TCTCGGTGTC GAGCTATCTG 1876 

TGTGGGCAGA GCCTGGATCT CCCACCCTGC ACTGGCCTCC TTGGTGCCG 1925 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 459 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 



Met 


Leu 


Pro 


Asp 


Phe 


Met 


Leu 


Val 


Leu 


He Val 


Leu 


Gly He 


Pro Ser 


1 








5 










10 




15 


Ser 


Ala 


Thr 


Thr 


Gly 


Phe 


Asn 


Ser 


He 


Ala Glu 


Asn 


Glu Asp 


Ala Leu 








20 










25 






30 




Leu 


Arg 


His 


Leu 


Phe 


Gin 


Gly 


Tyr 


Gin 


Lys Trp 


Val 


Arg Pro 


Val Leu 






35 










40 








45 




His 


Ser 


Asn 


Asp 


Thr 


He 


Lys 


Val 


Tyr 


Phe Gly 


Leu 


Lys He 


Ser Gin 




50 










55 








60 






Leu 


Val 


Asp 


Val 


Asp 


Glu 


Lys 


Asn 


Gin 


Leu Met 


Thr 


Thr Asn 


Val Trp 


65 










70 








75 






80 


Leu 


Lys 


Gin 


Glu 


Trp 


Thr 


Asp 


His 


Lys 


Leu Arg 


Trp 


Asn Pro 


Asp Asp 










85 










90 






95 


Tyr 


Gly 


Gly 


He 


His 


Ser 


He 


Lys 


Val 


Pro Ser 


Glu 


Ser Leu 


Trp Leu 








100 










105 






110 




Pro 


Asp 


He 


Val 


Leu 


Phe 


Glu 


Asn 


Ala 


Asp Gly 


Arg 


Phe Glu 


Gly Ser 
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Leu Met 
130 
Pro Pro 
145 

Pro Phe 

Asp Gly 

Lys Asp 

Met Lys 
210 
Tyr Ser 
225 

He He 

Leu Pro 

Val Ser 

Ser Ser 
290 
He Phe 
305 

His His 

Arg Leu 

Val Asp 

Lys Gly 
370 
Glu Lys 
385 

Tyr He 

Gin Asp 

. Leu Phe 

Ala Leu 
450 



115 

Thr Lys 

Ala Ser 

Asp Arg 

Thr Met 
180 
Phe Phe 
195 

Gly Asn 

Phe Val 

Pro Cys 

Ser Asp 
260 
Leu Thr 
275 

Lys Val 

Val Thr 

Arg Ser 

Phe Leu 
340 
Arg Tyr 
355 

Lys Val 

Val Leu 

Ser Arg 

T2rp Lys 
420 
Leu He 
435 

Lys Met 



Val He Val 
135 

Tyr Lys Ser 

150 
Gin Asn Cys 
165 

Val Asp Leu 

Asp Asn Gly 

Arg Arg Asp 
215 

Leu Arg Arg 

230 
Leu Gly Leu 
245 

Glu Gly Glu 

Val Phe Leu 

He Pro Leu 
295 

Leu Ser He 

310 
Ser Ser Thr 
325 

Gin Lys Leu 

Ser Ser Pro 

Leu Glu Lys 
375 

Val Ala Phe 

390 
His Val Lys 
405 

Phe Val Ala 

Val Ser Val 

Trp Leu His 
455 



120 

Lys Ser 

Ser Cys 

Ser Met 

He Leu 
185 
Glu Trp 
200 

Gly Val 

Leu Pro 

Ser Phe 

Lys Leu 
265 
Leu Val 
280 

He Gly 

He Val 

Tyr His 

Pro Lys 
345 
Glu Lys 
360 

Lys Lys 

Leu Glu 

Lys Glu 

Gin Val 
425 
Thr Gly 
440 

Ser Tyr 



125 

Asn Gly Thr Val 
140 

Asp Val 



Thr Met 
155 
Lys Phe 
170 

He Asn 

Glu He 

Tyr Ser 

Leu Phe 
235 
Leu Thr 
250 

Ser Leu 

He Glu 

Glu Tyr 

Thr Val 
315 
Pro Met 
330 

Leu Leu 

Glu Glu 

Gin Lys 

Lys Ala 
395 
His Phe 
410 

Leu Asp 
Ser Val 
His 



Gly Ser 

Glu Asn 

Leu Asn 
205 
Tyr Pro 
220 

Tyr Thr 

Val Leu 

Ser Thr 

Glu He 
285 
Leu Leu 
300 

Phe Val 

Ala Pro 

Cys Met 

Ser Gin 
365 
Gin Leu 
380 

Ala Asp 

He Ser 

Arg He 

Leu He 
445 



Val Trp Thr 

Thr Phe Phe 
160 

Trp Thr Tyr 

175 
Val Asp Arg 
190 

Ala Lys Gly 

Phe He Thr 

Leu Phe Leu 
240 

Val Phe Tyr 

255 
Ser Val Leu 
270 

He Pro Ser 

Phe He Met 

He Asn Val 
320 

Trp Val Lys 

335 
Lys Asp His 
350 

Pro Val Val 

Ser Asp Gly 

Ser He Arg 
400 

Gin Val Val 

415 
Phe Leu Trp 
430 

Phe Thr Pro 



(2) INFORMATION FOR SEQ ID N0:17; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1915 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : do\ible 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 87,.. 1583 
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(D) OTHER INFORMATION: beta4 human neuronal nicotinic 

acetylcholine receptor 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1...B6 
(D) OTHER INFORMATION: 



(A) NAME /KEY: 3'UTR 

(B) LOCATION: 1584... 1915 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CCGGCGCTCA CTCGACCGCG CGGCTCACGG GTGCCCTGTG ACCCCACAGC GGAGCTCGCG 60 
GCGGCTGCCA CCCGGCCCCG CCGGCC ATG AGG CGC GCG CCT TCC CTG GTC CTT 113 

Met Arg Arg Ala Pro Ser Leu Val Leu 
1 5 

TTC TTC CTG GTC GCC CTT TGC GGG CGC GGG AAC TGC CGC GTG GCC AAT 161 
Phe Phe Leu Val Ala Leu Cys Gly Arg Gly Asn Cys Arg Val Ala Asn 
10 15 20 25 

GCG GAG GAA AAG CTG ATG GAC GAC CTT CTG AAC AAA ACC CGT TAC AAT 209 
Ala Glu Glu Lys Leu Met Asp Asp Leu Leu Asn Lys Thr Arg Tyr Asn 
30 35 40 

AAC CTG ATC CGC CCA GCC ACC AGC TCC TCA CAG CTC ATC TCC ATC AAG 257 
Asn Leu lie Arg Pro Ala Thr Ser Ser Ser Gin Leu lie Ser lie Lys 
45 50 55 

CTG CAG CTC TCC CTG GCC CAG CTT ATC AGC GTG AAT GAG CGA GAG CAG 305 
Leu Gin Leu Ser Leu Ala Gin Leu lie Ser Val Asn Glu Arg Glu Gin 
60 65 70 

ATC ATG ACC ACC AAT GTC TGG CTG AAA CAG GAA TGG ACT GAT TAC CGC 353 
lie Met Thr Thr Asn Val Trp Leu Lys Gin Glu Trp Thr Asp Tyr Arg 
75 80 85 

CTG ACC TGG AAC AGC TCC CGC TAC GAG GGT GTG AAC ATC CTG AGG ATC 401 
Leu Thr Trp Asn Ser Ser Arg Tyr Glu Gly Val Asn lie Leu Arg lie 
90 95 100 105 

CCT GCA MlG CGC ATC TGG TTG CCT GAC ATC GTG CTT TAC AAC AAC GCC 449 
Pro Ala Lys Arg lie Trp Leu Pro Asp lie Val Leu Tyr Asn Asn Ala 
110 115 120 

GAC GGG ACC TAT GAG GTG TCT GTC TAC ACC AAC TTG ATA GTC CGG TCC 497 
Asp Gly Thr Tyr Glu Val Ser Val Tyr Thr Asn Leu lie Val Arg Ser 
125 130 135 

AAC GGC AGC GTC CTG TGG CTG CCC CCT GCC ATC TAC AAG AGC GCC TGC 545 
Asn Gly Ser Val Leu Trp Leu Pro Pro Ala lie Tyr Lys Ser Ala Cys 
140 145 150 

AAG ATT GAG GTG AAG TAC TTT CCC TTC GAC CAG CAG AAC TGC ACC CTC 593 
Lys He Glu Val Lys Tyr Phe Pro Phe Asp Gin Gin Asn Cys Thr Leu 
155 160 165 



AAG TTC- CGC TCC TGG ACC TAT GAC CAC ACG GAG ATA GAC ATG GTC CTC 



641 
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Lys Phe Arg Ser Trp Thr Tyr Asp His Thr Glu He Asp Met Val Leu 
170 175 180 185 

ATG ACG CCC ACA GCC AGC ATG GAT GAC TTT ACT CCC AGT GGT GAG TGG 689 
Met Thr Pro Thr Ala Ser Met Asp Asp Phe Thr Pro Ser Gly Glu Trp 
190 195 200 

GAC ATA GTG GCC CTC CCA GGG AGA AGG ACA GTG AAC CCA CAA GAC CCC 737 
Asp He Val Ala Leu Pro Gly Arg Arg Thr Val Asn Pro Gin Asp Pro 
205 210 215 

AGC TAC GTG GAC GTG ACT TAC GAC TTC ATC ATC AAG CGC AAG CCT CTG 785 
Ser Tyr Val Asp Val Thr Tyr Asp Phe He He Lys Arg Lys Pro Leu 
220 225 230 

TTC TAC ACC ATC AAC CTC ATC ATC CCC TGC GTG CTC ACC ACC TTG CTG 833 
Phe Tyr Thr He Asn Leu He He Pro Cys Val Leu Thr Thr Leu Leu 
235 240 245 

GCC ATC CTC GTC TTC TAC CTG CCA TCC GAC TGC GGC GAG AAG ATG ACA 881 
Ala He Leu Val Phe Tyr Leu Pro Ser Asp Cys Gly Glu Lys Met Thr 
250 255 260 265 

CTG TGC ATC TCA GTG CTG CTG GCA CTG ACA TTC TTC CTG CTG CTC ATC 929 
Leu Cys He Ser Val Leu Leu Ala l,eu Thr Phe Phe Leu Leu Leu He 
270 275 280 

TCC AAG ATC GTG CCA CCC ACC TCC CTC GAT GTG CCT CTC ATC GGC AAG 977 
Ser Lys He Val Pro Pro Thr Ser Leu Asp Val Pro Leu He Gly Lys 
265 290 295 

TAC CTC ATG TTC ACC ATG GTG CTG GTC ACC TTC TCC ATC GTC ACC AGC 1025 
Tyr Leu Met Phe Thr Met Val Leu Val Thr Phe Ser He Val Thr Ser 
300 305 310 

GTC TGT GTG CTC AAT GTG CAC CAC CGC TCG CCC AGC ACC CAC ACC ATG 1073 
Val Cys Val Leu Asn Val His His Arg Ser Pro Ser Thr His Thr Met 
315 320 325 

GCA CCC TGG GTC AAG CGC TGC TTC CTG CAC AAG CTG CCT ACC TTC CTC 1121 
Ala Pro Trp Val Lys Arg Cys Phe Leu His Lys Leu Pro Thr Phe Leu 
330 335 340 345 

TTC ATG AAG CGC CCT GGC CCC GAC AGC AGC CCG GCC AGA GCC TTC CCG 1169 
Phe Met Lys Arg Pro Gly Pro Asp Ser Ser Pro Ala Arg Ala Phe Pro 
350 355 360 

CCC AGC AAG TCA TGC GTG ACC AAG CCC GAG GCC ACC GCC ACC TCC ACC 1217 
Pro Ser Lys . Ser Cys Val Thr Lys Pro Glu Ala Thr Ala Thr Ser Thr 
365 370 375 

AGC CCC TCC AAC TTC TAT GGG AAC TCC ATG TAC TTT GTG AAC CCC GCC 1265 
Ser Pro Ser Asn Phe Tyr Gly Asn Ser Met Tyr Phe Val Asn Pro Ala 
380 385 390 

TCT GCA GCT TCC AAG TCT CCA GCC GGC TCT ACC CCG GTG GCT ATC CCC 1313 
Ser Ala Ala Ser Lys Ser Pro Ala Gly Ser Thr Pro Val Ala He Pro 
395 400 405 

AGG GAT TTC TGG CTG CGG TCC TCT GGG AGG TTC CGA CAG GAT GTG CAG 1361 
Arg Asp Phe Trp Leu Arg Ser Ser Gly Arg Phe Arg Gin Asp Val Gin 
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425 

ATG AAG AAT GAC 1409 
Met Lys Asn Asp 
440 

GTG GCT ATG GTG 1457 
Val Ala Met Val 
455 

TTC ATG TTT GTG TGC GTC CTG GGC 1505 
Phe Met Phe Val Cys Val Leu Gly 
470 



410 

GAG GCA TTA 
Glu Ala Leu 



GAT GAA GAC 
Asp Glu Asp 



GTG GAC CGG 
Val Asp Arg 
460 

ACT GTG 
Thr Val 
475 

GAG GGG CCC 
Glu Gly Pro 
490 



415 

GAA 6GT GTC 
Glu Gly Val 
430 



CTG TTC CTG 
Leu Phe Leu 



TAC GCT GCC 
Tyr Ala Ala 
495 



AGC TTC ATC 
Ser Phe lie 



TGG GTG 
Trp Val 
465 

CCG CCC CTC 
Pro Pro Leu 
480 

CAG CGT GAC 
Gin Arg Asp 



420 

GCC CAG CAC 
Ala Gin His 
435 

TGG AAG TAC 
Trp Lys Tyr 



TTC CAG ACC 
Phe Gin Thr 
485 



CAT GCA GCT 
His Ala Ala 



TCT 1553 
Ser 



TGA GGGCCCCCTG GGTTGTGGGG TGAG 1607 
* 



GGG CTC TTC CTA 
Gly Leu Phe Leu 



CAG AGT GTC GTT GAG GAC 
Gin Ser Val Val Glu Asp 
445 450 



AGGATGTGAG TGGCCGGGTG GGCACTTTGC TGCTTCTTTC TGGGTTGTGG CCGATGAGGC 1667 

CCTAAGTAAA TATGTGAGCA TTGGCCATCA ACCCCATCAA ACCAGCCACA GCCGTGGAAC 1727 

AGGCAAGGAT GGGGGCCTGG GCTGTCCTCT CTGAATGCCT TGGAGGGATC CCAGGAAGCC 1787 

CCAGTAGGAG GGAGCTTCAG ACAGTTCAAT TCTGGCCTGT CTTCCTTCCC TGCACCGGGC 1847 

AATGGGGATA AAGATGACTT CGTAGCAGCA CCTACTATGC TTCAGGCATG GTGCCGGCCT 1907 

GCCTCTCC 1915 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQXJENCE CHARACTERISTICS: 

(A) LENGTH: 499 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: i 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Met Arg Arg Ala Pro Ser Leu Val Leu Phe Phe Leu Val Ala Leu Cys 

1 5 10 15 

Gly Arg Gly Asn Cys Arg Val Ala Asn Ala Glu Glu Lys Leu Met Asp 

20 25 30 

Asp Leu Leu Asn Lys Thr Arg Tyr Asn Asn Leu lie Arg Pro Ala Thr 

35 40 45 

Ser Ser Ser Gin Leu lie Ser He Lys Leu Gin Leu Ser Leu Ala Gin 

50 55 60 

Leu He Ser Val Asn Glu Arg Glu Gin He Met Thr Thr Asn Val Trp 
65 70 75 80 

Leu Lys Gin Glu Trp Thr Asp Tyr Arg Leu Thr Trp Asn Ser Ser Arg 

85 90 95 

Tyr Glu Gly Val Asn He Leu Arg He Pro Ala Lys Arg He Trp Leu 

100 105 110 

Pro Asp He Val Leu Tyr Asn Asn Ala Asp Gly Thr Tyr Glu Val Ser 
115 120 125 
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Val Tyr Thr 

130 
Pro Pro Ala 
145 

Pro Phe Asp 

Asp His Thr 

Asp Asp Phe 
195 

Arg Arg Thr 

210 
Asp Phe lie 
225 

lie Pro Cys 

Pro Ser Asp 

Ala Leu Thr 
275 

Ser Leu Asp 

290 
Leu Val Thr 
305 

His Arg Ser 

Phe Leu His 

Asp Ser Ser 
355 

Lys Pro Glu 

370 
Asn Ser Met 
385 

Ala Gly Ser 

Ser Gly Arg 

Phe lie Ala 
435 

Glu Asp Trp 

450 
Val Phe Met 
465 

Pro Leu Phe 
Arg Asp 



Asn Leu 

He Tyr 

Gin Gin 
165 
Glu He 
180 

Thr Pro 

Val Asn 

He Lys 

val Leu 
245 
Cys Gly 
260 

Phe Phe 

Val Pro 

Phe Ser 

Pro Ser 
325 
Lys Leu 
340 

Pro Ala 

Ala Thr 

Tyr Phe 

Thr Pro 
405 
Phe Arg 
420 

Gin His 

Lys Tyr 

Phe Val 

Gin Thr 
485 



He Val 
135 
Lys Ser 
150 

Asn Cys 

Asp Met 

Ser Gly 

Pro Gin 
215 
Arg Lys 
230 

Thr Thr 

Glu Lys 

Leu Leu 

lieu He 
295 
He Val 
310 

Thr His 

Pro Thr 

Arg Ala 

Ala Thr 
375 
Val Asn 
390 

Val Ala 

Gin Asp 

Met Lys 

Val Ala 
455 
Cys Val 
470 

His Ala 



Arg Ser 
Ala Cys 
Thr Leu 



Asn Gly Ser 
140 

Lys He Glu 

155 
Lys Phe Arg 
170 

Met Thr Pro 



Val Leu 
185 

Glu Trp Asp He Val 
200 

Asp Pro 



Pro Leu 

Leu Leu 

Met Thr 
265 
Leu He 
280 

Gly Lys 

Thr Ser 

Thr Met 

Phe Leu 
345 
Phe Pro 
360 

Ser Thr 
Pro Ala 
He Pro 



Ser Tyr Val 
220 

Phe Tyr Thr 

235 
Ala He Leu 
250 

Leu Cys He 

Ser Lys He 

Tyr Leu Met 
300 

Val Cys Val 

315 
Ala Pro Trp 
330 

Phe Met Lys 

Pro Ser Lys 

Ser Pro Ser 
380 

Ser Ala Ala 

395 
Arg Asp Phe 
410 

Glu Ala Leu 



Val Gin 
425 

Asn Asp Asp Glu Asp 
440 

Met Val 



Leu Gly 
Ala Ser 



Val Asp Arg 
460 

Thr Val Gly 

475 
Glu Gly Pro 
490 



Val Leu Trp Leu 

Val Lys Tyr Phe 
160 

Ser Trp Thr Tyr 
175 

Thr Ala Ser Met 
190 

Ala Leu Pro Gly 
205 

Asp Val Thr Tyr 

He Asn Leu He 
240 

Val Phe Tyr Leu 

255 

Ser Val Leu Leu 
270 

Val Pro Pro Thr 
285 

Phe Thr Met Val 

Leu Asn Val His 
320 

Val Lys Arg Cys 
335 

Arg Pro Gly Pro 
350 

Ser Cys Val Thr 
365 

Asn Phe Tyr Gly 

Ser Lys Ser Pro 
400 

Trp Leu Arg Ser 
415 

Glu Gly Val Ser 
430 

Gin Ser Val Val 
445 

Leu Phe Leu Trp 

Leu Phe Leu Pro 
480 

Tyr Ala Ala Gin 
495 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1698 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: 
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(vi) ORIGINAL SOURCE: 
(ix) FEATURE: 

(A) NAME/KEY: Coding Sequence 

(B) LOCATION: 143... 1582 

(D) OTHER INFORMATION: alpha6 (del 74-88) subunit 

human neuronal nicotinic acetylcholine rec. 

(A) NAME/KEY: 5'UTR 

(B) LOCATION: 1...143 
(D) OTHER INFORMATION: 



(A) NAME/KEY: 3'UTR 

(B) LOCATION: 1583... 1698 
(D) OTHER INFORMATION: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CGGGTTTTGA TTTCTGAGAA GACACACACG GATTGCAGTG GGCTTCTGAT GATGTCAAGG 60 
TTGGATGCAT GTGGCTGACT 6ATAGCTCTT TGTTTTCCAC AATCCTTTGC CTAGGAAAAA 120 
GGAATCCAAG TGTGTTTTAA CC ATG CTG ACC AGC AAG GGG CAG GGA TTC CTT 172 

Met Leu Thr Ser Lys Gly Gin Gly Phe Leu 
15 10 

CAT GGG GGC TTG TGT CTC TGG CTG TGT GTG TTC ACA CCT TTC TTT AAA 220 
His Gly Gly Leu Cys Leu Trp Leu Cys Val Phe Thr Pro Phe Phe Lys 
15 20 25 

GGC TGT GTG GGC TGT GCA ACT GAG GAG AGG CTC TTC CAC AAA CTG TTT 268 
Gly Cys Val Gly Cys Ala Thr Glu Glu Arg Leu Phe His Lys Leu Phe 
30 35 40 

TCT CAT TAC AAC CAG TTC ATC AGG CCT GTG GAA AAC GTT TCC GAC CCT 316 
Ser His Tyr Asn Gin Phe lie Arg Pro Val Glu Asn Val Ser Asp Pro 
45 50 55 

GTC ACG GTA CAC TTT GAA GTG GCC ATC ACC CAG CTG GCC AAC GTG ATC 364 
Val Thr Val His Phe Glu Val Ala lie Thr Gin Leu Ala Asn Val lie 
60 65 70 

TGG AAT GAT TAT AAA TTG CGC TGG GAT CCA ATG GAA TAT GAT GGC ATT 412 
Trp Asn Asp Tyr Lys Leu Arg Trp Asp Pro Met Glu Tyr Asp Gly lie 
75 80 85 90 

GAG ACT CTT CGC GTT CCT GCA GAT AAG ATT TGG AAG CCC GAC ATT GTT 460 
Glu Thr Leu Arg Val Pro Ala Asp Lys He Trp Lys Pro Asp He Val 
95 100 105 

CTC TAT AAC AAT GCT GTT GGT GAC TTC CAA GTA GAA GGC AAA ACA AAA 508 
Leu Tyr Asn Asn Ala Val Gly Asp Phe Gin Val Glu Gly Lys Thr Lys 
110 115 120 

GCT CTT CTT AAA TAC AAT GGC ATG ATA ACC TGG ACT CCA CCA GCT ATT 556 
Ala Leu Leu Lys Tyr Asn Gly Met He Thr Trp Thr Pro Pro Ala He 
125 130 135 

TTT AAG AGT TCC TGC CCT ATG GAT ATC ACC TT ] TTC CCT TTT GAT CAT 604 
Phe Lys Ser Ser Cys Pro Met Asp He Thr Phti Phe Pro Phe Asp His 
140 145 150 
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CAA AAC TGT TCC CTA AAA TTT GGT TCC TGG ACG TAT GAC AAA GOT GAA 652 
Gin Asn Cys Ser Leu Lys Phe Gly Ser Trp Thr Tyr Asp Lys Ala Glu 
155 160 165 170 

ATT GAT CTT CTA ATC ATT GGA TCA AAA GTG GAT ATG AAT GAT TTT TGG 700 
He Asp Leu Leu He He Gly Ser Lys Val Asp Met Asn Asp Phe Trp 
175 180 185 

GAA AAC AGT GAA TGG GfiA ATC ATT GAT GCC TCT GGC TAG AAA CAT GAC 748 
Glu Asn Ser Glu Trp Glu He He Asp Ala Ser Gly Tyr Lys His Asp 
190 195 200 

ATC AAA TAC AAC TGT TGT GAA GAG ATA TAG ACA GAT ATA ACC TAT TCT 796 
He Lys Tyr Asn Cys Cys Glu Glu He Tyr Thr Asp He Thr Tyr Ser 
205 210 215 

TTC TAC ATT AGA AGA TTG CCG ATG TTT TAC ACG ATT AAT CTG ATC ATC 844 
Phe Tyr He Arg Arg Leu Pro Met Phe Tyr Thr He Asn Leu He He 
220 225 230 

CCT TGT CTC TTT ATT TCA TTT CTA ACC GTG TTG GTC TTT TAC CTT CCT 892 
Pro Cys Leu Phe He Ser Phe Leu Thr Val Leu Val Phe Tyr Leu Pro 
235 240 245 250 

TCQ GAC TGT GGT GAA AAA GTG ACG CTT TGT ATT TCA GTC CTG CTT TCT 940 
Ser Asp Cys Gly Glu Lys Val Thr Leu Cys He Ser Val Leu Leu Ser 
255 260 265 

CTG ACT GTG TTT TTQ CTG GTC ATC ACA GAA ACC ATC CCA TCC ACA TCT 988 
Leu Thr Val Phe Leu Leu Val He Thr Glu Thr He Pro Ser Thr Ser 
270 275 280 

CTG GTG GTC CCA CTG GTG GGT GAG TAC CTG CTG TTC ACC ATG ATC TTT 1036 
Leu Val Val Pro Leu Val Gly Glu Tyr Leu Leu Phe Thr Met He Phe 
285 290 295 

GTC ACA CTG TCC ATC GTG GTG ACT GTG TTT GTG TTG AAC ATA CAC TAC 1084 
Val Thr Leu Ser He Val Val Thr Val Phe Val Leu Asn He His Tyr 
300 305 310 

CGC ACC CCA ACC ACG CAC ACA ATG CCC AGG TGG GTG AAG ACA GTT TTC 1132 
Arg Thr Pro Thr Thr His Thr Met Pro Arg Trp Val Lys Thr Val i Phe 
315 320 325 330 

CTG AAG CTG CTG CCC CAG GTC CTG CTG ATG AGG TGG CCT CTG GAC AAG 1180 
Leu Lys Leu Leu Pro Gin Val Leu Leu Met Arg Trp Pro Leu Asp Lys 
335 340 345 

ACA AGG GGC ACA GGC TCT GAT GCA GTG CCC AGA GGC CTT GCC AGG AGG 1228 
Thr Arg Gly Thr Gly Ser Asp Ala Val Pro Arg Gly Leu Ala Arg Arg 
350 355 360 

CCT GCC AAA GGC AAG CTT GCA AGC CAT GGG GAA CCC AGA CAT CTT AAA 1276 
Pro Ala Lys Gly Lys Leu Ala Ser His Gly Glu Pro Arg His Leu Lys 
365 370 375 

GAA TGC TTC CAT TGT CAC AAA TCA AAT GAG CTT GCC ACA AGC AAG AGA 1324 
Glu Cys Phe His Cys His Lys Ser Asn Glu Leu Ala Thr Ser Lys Arq 
380 385 390 

AGA TTA AGT CAT CAG CCA TTA CAG TGG GTG GTG GAA AAT TCG GAG CAC 1372 
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Arg Leu Ser His Gin Pro Leu Gin Trp Val Val Glu Asn Ser Glu His 
395 400 405 410 

TCG CCT 6AA GTT GAA GAT GTG ATT AAC AGT GTT CAG TTC ATA GCA GAA 1420 
Ser Pro Glu Val Glu Asp Val lie Asn Ser Val Gin Phe He Ala Glu 
415 420 425 

AAC ATG AAG AGC CAC AAT GAA ACC AAG GAG GTA GAA GAT GAC TGG AAA 1468 
Asn Met Lys Ser His Asn Glu Thr Lys Glu Val Glu Asp Asp Trp Lys 
430 435 440 

TAG GTG GCC ATG GTG GTG GAC AGA GTA TTT CTT TGG GTA TTT ATA ATT 1516 
Tyr Val Ala Met Val Val Asp Arg Val Phe Leu Trp Val Phe He He 
445 450 455 

GTC TGT GTA TTT GGA ACT GCA GGG CTA TTT CTA CAG CCA CTA CTT GGG 1564 
Val Cys Val Phe Gly Thr Ala Gly Leu Phe Leu Gin Pro Leu Leu Gly 
460 465 470 

AAC ACA GGA AAA TCT TAA 7ATGTATTTT CTTTTATGTT CAGAAATTTA CAQACACCA 1621 
Asn Thr Gly Lys Ser * 
475 460 

TATTTGTTCT GCATTCCCTG CCACAAGGAA AGGAAA6CAA AGGCTTCCCA CCCAAGTCCC 1681 
CCATCTGCTA AAACCCG 1698 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(v) FRAGMENT TYPE: internal 

(vi) ORIGINAL SOURCE: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



Met 


Leu 


Thr 


Ser 


Lys 


Gly 


Gin 


Gly 


Phe 


Leu 


His 


Gly Gly Leu Cys Leu 


1 








5 










10 




15 


Trp 


Leu 


Cys 


Val 


Phe 


Thr 


Pro 


Phe 


Phe 


Lys 


Gly 


Cys Val Gly Cys Ala 








20 










25 






30 


Thr 


Glu 


Glu 


Arg 


Leu 


Phe 


His 


Lys 


Leu 


Phe 


Ser 


His Tyr Asn Gin Phe 






35 










40 








45 


He 


Arg 


Pro 


Val 


Glu 


Asn 


Val 


Ser 


Asp 


Pro 


Val 


Thr Val His Phe Glu 




50 










55 








60 


Val 


Ala 


He 


Thr 


Gin 


Leu 


Ala 


Asn 


Val 


He 


Trp 


Asn Asp Tyr Lys Leu 


65 










70 










75 


80 


Arg 


Trp 


Asp 


Pro 


Met 


Glu 


Tyr 


Asp 


Gly 


He 


Glu 


Thr Leu Arg Val Pro 










85 










90 




95 


Ala 


Asp 


Lys 


He 


Trp 


Lys 


Pro 


Asp 


He 


Val 


Leu 


Tyr Asn Asn Ala Val 








100 










105 






110 


Gly 


Asp 


Phe 


Gin 


Val 


Glu 


Gly 


Lys 


Thr 


Lys 


Ala 


Leu Leu Lys Tyr Asn 






115 










120 








125 


Gly 


Met 


He 


Thr 


Trp 


Thr 


Pro 


Pro 


Ala 


He 


Ph 1 


Lys Ser Ser Cys Pro 




130 










135 










140 


Met 


Asp 


He 


Thr 


Phe 


Phe 


Pro 


Phe 


Asp 


His 


Gin 


Asn Cys Ser Leu Lys 
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1 






1 tzn 










ICC 

X55 


160 










ASp 


Lys 


2il a 

Axa 


uXU 


xxe 


Asp Leu Leu He He 
















X70 




175 






T.'xra 


Va1 Zlart Ma^ 

VAX naff CUSUt 


Asn 


ASp 


Jriie 


Trp 


/^l 4« 

GXU 


Asn Ser Glu Trp Glu 














xes 






190 








Ma fier Glv 

aXA OCX VXjr 


lyr 


ijys 


nxs 


Asp 


Tl A 

xxe 


Lys Tyr Asn Cys Cys 












"7 nn 

2UU 








205 


V7JLU 


UJLU 




xyx xxix Asp 


xxe 


lllx 


Tyr 


oer 


Fne 


Tyr He Arg Arg Leu 




















220 




Moi- 


IT lie 


±jrX XUX XX6 


noil 


ijeu 


Tl A 
XX6 


Tl A 

xxe 


Fro 


Cys Leu Phe He Ser 


225 






•5 "a A 
2 JU 










235 


240 


iriie 


I16U 


rpVii* 

inr 


Taii Val 
VaX JUeU VaX 




Tyr 


Leu 


irro 


Ser 


Asp Cys Gly Glu Lys 








Oil c 

245 


vax 






250 




255 




Tiur 


jjeu 


Cys He Ser 


Leu 


Leu 


Ser 


Leu 


Thr Val Phe Leu Xieu 








260 






265 






270 


vajL 


lie 


TJir 


Gxu TXir xxe 


Pro 


ser 


Tnr 


Ser 


Leu 


Val Val Pro Leu Val 






275 






280 








265 


Qiy 


Glu 


Tyr 


Leu Leu Phe 


Thr 


Met 


He 


Phe 


Val 


Thr Leu Ser He Val 




290 






295 










300 


Vai 


Thr 


Val 


Phe Val Leu 


Asn 


He 


His 


Tyr 


Arg 


Thr Pro Thr Thr His 


305 






310 








315 


320 


Thr 


Met 


Pro 


Arg Trp Val 


Lys 


Thr 


Val 


Phe 


Leu 


Lys Leu Leu Pro Gin 








325 








330 




335 ' 


Val 


Leu 


Leu 


Met Arg Trp 


Pro 


Leu 


Asp 


Lys 


Thr 


Arg Gly Thr Gly Ser 








340 






345 






350 


Asp 


Ala 


Val 


Pro Arg Gly 


Leu 


Ala 


Arg 


Arg 


Pro 


Ala Lys Gly Lys Leu 






355 






360 








365 


Ala 


Ser 


His 


Gly Glu Pro 


Arg 


His 


Leu 


Lys 


Glu 


Cys Phe His Cys His 




370 






375 










380 


Lys 


Ser 


Asn 


Glu Leu Ala 


Thr 


Ser 


Lys 


Arg 


Arg 


Leu Ser His Gin Pro 


385 






390 










395 


400 


Leu 


Gin 


Trp 


Val Val Glu 


Asn 


Ser 


Glu 


His 


Ser 


Pro Glu Val Glu Asp 








405 








410 




415 


Val 


He 


Asn 


Ser Val Gin 


Phe 


He 


Ala 


Glu 


Asn 


Met Lys Ser His Asn 








420 






425 






430 


Glu 


Thr 


Lys 


Glu Val Glu 


Asp 


Asp 


Trp 


Lys 


Tyr 


Val Ala Met Val Val 






435 






440 








445 


Asp 


Arg 


Val 


Phe Leu Trp 


Val 


Phe 


He 


He 


Val 


Cys Val Phe Gly Thr 




450 






455 










460 


Ala 


Gly 


Leu 


Phe Leu Gin 


Pro 


Leu 


Leu 


Gly 


Asn 


Thr Gly Lys Ser 


465 






470 










475 
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Summary of Sequences 

Sequence ID No. 1 is a nucleotide sequence encoding an subunit 
of a human neuronal nicotinic acetylcholine receptor, and the deduced 
amino acid sequence thereof. 
5 Sequence ID No. 2 is the amino acid sequence of the 02 subunit of 

a human neuronal nicotinic acetylcholine receptor set forth in Sequence 
ID No. 1. 

Sequence ID No. 3 is a nucleotide sequence encoding a subunit 
of a human neuronal nicotinic acetylcholine receptor, and the deduced 
10 amino acid sequence thereof. 

Sequence ID No. 4 is the amino acid sequence of the O3 subunit of 
a human neuronal nicotinic acetylcholine receptor set forth in Sequence 
ID No. 3. 

Sequence ID No. 5 is a nucleotide sequence encoding an 04 subunit 
15 of a human neuronal nicotinic acetylcholine receptor, and the deduced 
amino acid sequence thereof. 

Sequence ID No. 6 is the amino acid sequence of the 04 subunit of 
a human neuronal nicotinic acetylcholine receptor set forth in Sequence 
ID No. 5. 

20 Sequence ID No. 7 is a nucleotide sequence encoding an subunit 

of a human neuronal nicotinic acetylcholine receptor, and the deduced 
amino acid sequence thereof. 

Sequence ID No. 8 is the amino acid sequence of the subunit of 
a human neuronal nicotinic acetylcholine receptor set forth in Sequence 

25 ID No. 7. 

Sequence ID No. 9 is a nucleotide sequence encoding an subunit 
of a human neuronal nicotinic acetylcholine receptor, and the deduced 
amino acid sequence thereof. 
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Sequence ID No. 10 is the amino acid sequence of the subunit 
of a human neuronal nicotinic acetylcholine receptor set forth in 
Sequence ID No. 9. 

Sequence ID No. 1 1 is a nucleotide sequence encoding an Oy 
5 subunit of a human neuronal nicotinic acetylcholine receptor, and the 
deduced amino acid sequence thereof. 

Sequence ID No. 12 is the amino acid sequence of the a-, subunit 
of a human neuronal nicotinic acetylcholine receptor set forth in 
Sequence ID No. 11. 
10 Sequence ID No. 13 is a nucleotide sequence encoding a 02 subunit 

of a human neuronal nicotinic acetylcholine receptor, and the deduced 
amino acid sequence thereof. 

Sequence ID No. 14 is the amino acid sequence of the 02 subunit 
of a human neuronal nicotinic acetylcholine receptor set forth in 
15 Sequence ID No. 13. 

Sequence ID No. 1 5 is a nucleotide sequence encoding a fi^ subunit 
of a human neuronal nicotinic acetylcholine receptor, and the deduced 
amino acid sequence thereof. 

Sequence ID No. 1 6 is the amino acid sequence of the 0^ subunit 
20 of a human neuronal nicotinic acetylcholine receptor set forth in 
Sequence ID No. 15. 

Sequence ID No. 17 is a nucleotide sequence encoding a 0^ subunit 
of a human neuronal nicotinic acetylcholine receptor, and the deduced 
amino acid sequence thereof. 
25 Sequence ID No. 18 is the amino acid sequence of the 0^ subunit 

of a human neuronal nicotinic acetylcholine receptor set forth in 
Sequence ID No. 17. 



I 

I 
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Sequence ID No. 1 9 is a nucleotide sequence encoding a variant 
subunit of a human neuronal nicotinic acetylcholine receptor, and the 
deduced amino acid sequence thereof. 

Sequence ID No. 20 is the amino acid sequence of the Oe subunit 
5 of a human neuronal nicotinic acetylcholine receptor set forth in 
Sequence ID No. 1 9. 
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THAT WHICH IS CLAIMED: 

1 . An isolated nucleic acid molecule, comprising a sequence of 
nucleotides encoding an subunit of a human neuronal nicotinic 
acetylcholine receptor. 
5 2. The molecule of claim 1 , wherein the subunit comprises 

the sequence of amino acids set forth in SEQ ID NO: 10 or functional 
equivalents thereof. 

3. The molecule of claim 1 , wherein the subunit comprises 
the sequence of amino acids set forth In SEQ ID NO: 10 
10 4. The molecule of claim 1 , wherein the subunit comprises 

the sequence of amino acids set forth in SEQ ID N0:20 or functional 
equivalents thereof. 

5. The molecule of claim 1 , wherein the subunit comprises 
the sequence of amino acids set forth in SEQ ID N0:20. 
15 6. The molecule of claim 1, wherein the sequence of 

nucleotides hybridizes to nucleotides 143-1624 set forth in SEQ ID N0:9 
under high stringency conditions, or 

the sequence of nucleotides hybridizes under high stringency 
conditions to nucleotides 143-1579 set forth in SEQ ID NO: 19. 
20 7. The molecule of claim 1, comprising nucleotides 143-1624 

set forth in SEQ ID N0:9 or functional equivalents thereof. 

8. The molecule of claim 1, comprising nucleotides 143-1624 
set forth in SEQ ID N0:9. 

9. The molecule of claim 1, comprising nucleotides 143-1579 
25 set forth in SEQ ID NO: 19 or functional equivalent thereof. 

10. The molecule of claim 1, comprising nucleotides 143-1579 
set forth in SEQ ID NO: 19. 
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11. An isolated nucleic acid molecule, comprising a sequence of 
nucleotides encoding a subunit of a human neuronal nicotinic 
acetylcholine receptor. 

1 2. The molecule of claim 1 1 , wherein the subunit comprises 
5 the sequence of amino acids set forth in SEQ ID NO: 16 or functional 

equivalents thereof. 

13. The molecule of claim 11, wherein the fi^ subunit comprises 
the sequence of amino acids set forth in SEQ ID NO: 16. 

14. The molecule of claim 11, comprising a sequence of 
10 nucleotides that hybridizes under high stringency conditions to 

nucleotides 98-1471 set forth in SEQ ID N0:15. 

15. The molecule of claim 11, comprising nucleotides 98-1471 
set forth in SEQ ID NO: 15 or functional equivalents thereof. 

16. The molecule of claim 11, comprising nucleotides 98-1471 
1 5 set forth in SEQ ID NO: 1 5, 

17. A single-stranded nucleic acid of at least 27 bases in length, 
comprising any 27 contiguous bases set forth in SEQ ID N0:9 or SEQ 

ID NO: 1 9 or the complement thereof. 

18. A single-stranded nucleic acid of at least 28 bases in length, 
20 comprising any 28 contiguous bases set forth in the first 105 nucleotides 

translated sequence set forth in SEQ ID NO: 15 or the complement 
thereof. 

19. The nucleic acid of claim 17 or claim 18 that is labeled. 

20. The nucleic acid of claim 19 that is labeled with ^^P. 
25 21 . A method for isolating DNA encoding a human nicotinic 

acetylcholine receptor subunit, comprising screening a library with the 
nucleic acid of claim 1 9, and isolating clones that hybridize under 
conditions of at least low stringency to the nucleic acid of claim 19. 
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22. The method of claim 21, wherein the isolated clones hybridize 
under conditions of high stringency. 

23. The method of claim 21 or claim 22, further comprising 
identifying those clones that encode an or /S^ subunit. 

5 24. Cells, comprising a nucleic acid molecule of claim 1 , wherein 

the cells are prokaryotic cells or eukaryotic cells and the nucleic acid is 
heterologous to the cells. 

25. The cells of claim 24 that are mammalian cells or amphibian 
oocytes. 

10 26. The cells of claim 24, further comprising heterologous 

nucleic acid encoding a fi subunit of human neuronal nicotinic 
acetylcholine receptor. 

27. The cells of claim 26, wherein the 0 subunit is selected from 
02' or 0^. 

1 5 28. The cells of claim 26, wherein the p subunit is jffg. 

29. The cells of claim 24, wherein the cells express functional 
neuronal nicotinic acetylcholine receptors that contain one or more 
subunits encoded by the heterologous nucleic acid. 

30. Cells, comprising a nucleic acid molecule of claim 1 1 , 

20 wherein the cells are prokaryotic cells or eukaryotic cells, and the nucleic 
acid molecule is heterologous to the cells. 

31 . The cells of claim 30 that are mammalian cells or amphibian 
oocytes. 

32. The cells of claim 31, further comprising heterologous 
25 nucleic acid encoding an a subunit of a human neuronal nicotinic 

acetylcholine receptor. 

33. The cells of claim 32, wherein the a subunit is selected from 
Oz, o^, ag, ag or Oy. 
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34. The cells of claim 30 that express functional neuronal 
nicotinic acetylcholine receptors that contain one or more subunits 
encoded by the heterologous nucleic acid. 

35. The cells of claim 31 that express functional neuronal 
5 nicotinic acetylcholine receptors that contain one or more subunits 

encoded by the heterologous nucleic acid. 

36. The molecule of claim 1 or claim 1 1 that is DNA. 

37. The molecule of claim 1 or claim 1 1 that is RNA. 

38. A method of screening compounds to identify compounds 
10 that modulate the activity of human neuronal nicotinic acetylcholine 

receptors, the method comprising determining the effect of a test 
compound on the neuronal nicotinic acetylcholine receptor activity in cells 
of claim 24 or claim 30 compared to the effect on control cells or to the 
neuronal nicotinic acetylcholine receptor activity of the cells in the 
1 5 absence of the compound. 

39. A substantially pure human neuronal nicotinic acetylcholine 
receptor subunit. 

40. A substantially pure recombinant human neuronal nicotinic 
acetylcholine receptor, comprising an human neuronal nicotinic 

20 acetylcholine receptor subunit. 

41 . The nicotinic acetylcholine receptor of claim 40, further 
comprising a human neuronal nicotinic acetylcholine receptor P subunit. 

42. A substantially pure human neuronal nicotinic acetylcholine 
receptor 0^ subunit. 

25 43. A substantially pure recombinant human neuronal nicotinic 

acetylcholine receptor, comprising an 02 human neuronal nicotinic 
acetylcholine receptor subunit. 
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44. The neuronal nicotinic acetylcholine receptor of claim 31 , 
further comprising at least one human neuronal nicotinic acetylcholine 
receptor a subunit. 

45. A method for identifying functional neuronal nicotinic 

5 acetylcholine receptor subunits and combinations thereof, comprising: 

(a) introducing a nucleic acid molecule of claim 1 into eukaryotic 
cells; and 

(b) detecting neuronal nicotinic acetylcholine receptor activity in 
the cells of step (a), wherein the activity is mediated by a receptor 

10 containing a subunit encoded by the introduced molecule. 

46. The method of claim 45, further comprising, introducing 
nucleic acid encoding one or more p ox a subunits of a human neuronal 
nicotinic acetylcholine receptor. 

47. A method for identifying functional neuronal nicotinic 

1 5 acetylcholine receptor subunits and combinations thereof, comprising: 

(a) introducing a nucleic acid molecule of claim 1 1 into 
eukaryotic cells; and 

(b) detecting neuronal nicotinic acetylcholine receptor activity in 
the cells of step (a), wherein the activity is mediated by a receptor 

20 containing a subunit encoded by the introduced molecule. 

48. The method of claim 47, further comprising, introducing 
nucleic acid encoding one or more /? or a subunits of a human neuronal 
nicotinic acetylcholine receptor. 

49. The nucleic acid of claim 1 or claim 1 1 that is mRNA. 
25 50. Isolated cells containing the mRNA of claim 49. 

51. Cells of claim 51, further comprising mRNA encoding an 
additional a ox p subunit of a human neuronal nicotinic acetylcholine 
receptor. 
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52. An isolated nucleic acid molecule, comprising nucleotides 
98-211 of SEQ ID NO: 15. 
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